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METHOD OF DIAGNOSING , 
MONITORING , STAGING, IMAGING AND TREATING PROSTATE CANCER 

FIELD OF THE INVENTION 

This invention relates, in part, to newly developed 
5 assays for detecting, diagnosing, monitoring, staging, 
prognosticating, imaging and treating cancers, particularly 
prostate cancer. 
BACKGROUND OF THE INVENTION 

Cancer of the prostate is the most prevalent malignancy 
0 in adult males, excluding skin cancer, and is an increasingly 
prevalent health problem in the United States. In 1996, it 
was estimated that 41,400 deaths would result from this 
disease in the United States alone, indicating that prostate 
cancer is second only to lung cancer as the most common cause 
of death in the same population. If diagnosed and treated 
early, when the cancer is still confined to the prostate, the 
chances of cure is significantly higher. 

Treatment decisions for an individual are linked to the 
stage of prostate cancer present in that individual. A common 
classification of the spread of prostate cancer was developed 
by the American Urological Association (AUA) . The AUA system 
divides prostate tumors into four stages, A to D. Stage A, 
microscopic cancer within prostate, is further subdivided into 
stages Al and A2 . Sub-stage Al is a well-differentiated 
cancer confined to one site within the prostate. Treatment 
is generally observation, radical prostatectomy, or radiation. 
Sub-stage A2 is a moderately to poorly differentiated cancer 
at multiple sites within the prostate. Treatment is radical 
prostatectomy or radiation. Stage B, palpable lump within the 
prostate, is also further subdivided into sub-stages Bl and 
B2. In sub-stage Bl, the cancer forms a small nodule in one 
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lobe of the prostate. In sub-stage B2, the cancer forms large 
or. multiple nodules, or occurs in both lobes of ttte prostate. 
Treatment for sub-stages Bl and B2 is either radical 
prostatectomy or radiation. Stage C is a large cancer mass 
5 involving most or all of the prostate and is also further 
subdivided into two sub-stages. In sub-stage CI, the cancer 
forms a continuous mass that may have extended beyond the 
prostate. In sub-stage C2, the cancer forms a continuous mass 
that invades the surrounding tissue. Treatment for both these 
10 sub-stages is radiation with or without drugs to address the 
cancer. The fourth stage, Stage D is metastatic cancer and 
is also subdivided into two sub-stages. In sub-stage Dl, the 
cancer appears in the lymph nodes of the pelvis. In sub-stage 
D2, the cancer involves tissues beyond lymph nodes. Treatment 
15 for both of these sub-stages is systemic drugs to address the 
cancer as well as pain. 

However, current prostate cancer staging methods are 
limited. As many as 50% of prostate cancers initially staged 
as A2, B, or C are actually stage D, metastatic. Discovery 
20 of metastasis is significant because patients with metastatic 
cancers have a poorer prognosis and require significantly 
different therapy than those with localized cancers. The five 
year survival rates for patients with localized and metastatic 
prostate cancers are 93% and 29%, respectively. 
25 Accordingly, there is a great need for more sensitive 

and accurate methods for the staging of a cancer in a human 
to determine whether or not such cancer has metastasized and 
for monitoring the progress of a cancer in a human which "has 
not metastasized for the onset of metastasis. 
30 It has now been found that a number of proteins in the 

public domain are useful as diagnostic markers for prostate 
cancer. These diagnostic markers are referred to herein as 
cancer specific genes or CSGs and include, but are not limited 
to: Prol09 which is a human zinc-a 2-glycoprotein (Freje et 
35 al. Genomics 1993 18 (3) : 575-587 ) ; Proll2 which is a human 
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cysteine-rich protein with a zinc-finger motif (Liebhaber et 
al. Nucleic Acid Research 1990 18 (13) :3871-3379; W€>9514772 and 
W09845436); Prolll which is a prostate-specific 
transglutaminase (Dubbink et al . Genomics 1998 51 (3 ): 434-444 ) ; 
5 Proll5 which is a novel serine protease with transmembrane, 
LDLR, and SRCR domains and maps to 21q22.3 ( Paoloni-Giacobino 
et al. Genomics 1997 4 4 ( 3 ) : 309-320 ; W0983741S and WO987093) ; 
ProllO which is a human breast carcinoma fatty acid synthase 
(U.S. Patent 5,665,874 and WO9403599) ; Proll3 which is a 
10 homeobox gene, HOXB13 (Steinicki et al . J. Invest. Dermatol. 
1998 111:57-63); Proll4 which is a human tetraspan NET-1 
(W09839446); and Proll8 which is a human JM27 protein 
(W09845435). ESTs for these CSGs are set forth in SEQ ID NO: 
1, 3, 5, 7, 9, 11, 13 and 15 while the full length contigs for 
15 these CSGs are set forth in SEQ ID NO:2, 4, 6, 8, 10, 12, 14 
and 16, respectively. Additional CSGs for use in the present 
invention are depicted herein in SEQ ID NO: 17, 18, 19 and 20. 

In the present invention, methods are provided for 
detecting, diagnosing, monitoring, staging, prognosticating, 
20 imaging and treating prostate cancer via the cancer specific 
genes referred to herein as CSGs. For purposes of the present 
invention, CSG refers, among other things, to native protein 
expressed by the gene comprising a polynucleotide sequence of 
SEQ ID NO:l, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 
25 16, 17, 18, 19 or 20. By "CSG" it i s also meant herein 
polynucleotides which, due to degeneracy in genetic coding, 
comprise variations in nucleotide sequence as "compared to SEQ 
ID NO: 1-20, but which still encode the same protein. In the 
alternative, what is meant by CSG as used herein, means the 
.30 native mRNA encoded by the gene comprising the polynucleotide 
sequence of SEQ ID NO:l, 2, 3, 4, 5, 6, .7, 8, 9, 10, 11, 12, 
13, 14, 15, 16, 17, 18, 19 or 20, levels of the gene 
comprising the polynucleotide sequence of SEQ ID NO:l, 2, 3, 
4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 or 
35 20, or levels of a polynucleotide which is capable of 
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hybridizing under stringent conditions to the antisense 
sequence of SEQ ID NO:l, 2, 3, 4, 5, 6, 7, 8, 9, 40, 11, 12, 
13, 14, 15, 16, 17, 18, 19 or 201 

Other objects, features, advantages and aspects of the 

5 present invention will become apparent to those of skill in 
the art from the following description. It should be 
understood, however, that the following description and the 
specific examples, while indicating preferred embodiments of 
the invention are given by way of illustration only. Various 

0 changes and modifications within the spirit and scope of the 
disclosed invention will become readily apparent to those 
skilled in the art from reading the following description and 
from reading the other parts of the present disclosure. 

SUMMARY OF THE INVENTION 

5 Toward these ends, and others, it is an object of the 

present invention to provide a method for diagnosing the 
presence of prostate cancer by analyzing for changes in levels 
of CSG in cells, tissues or bodily fluids compared with levels 
of CSG in preferably the same cells, tissues, or bodily fluid 

0 type of a normal human control, wherein a change in levels of 
CSG in the patient versus the normal human control is 
associated with prostate cancer. 

Further provided is a method of diagnosing metastatic 
prostate cancer in a patient having prostate cancer which is 

5 not known to have metastasized by identifying a human patient 
suspected of having prostate cancer that has metastasized; 
analyzing a sample of cells, tissues, or bodily fluid from 
such patient for CSG; comparing the CSG levels in such cells, 
tissues, or bodily fluid with levels of CSG in preferably the 

0 same cells, tissues, or bodily fluid type of a normal human 
control, wherein an increase in CSG levels in the patient 
versus the normal human control is associated with prostate 
cancer which has metastasized. 
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Also provided by the invention is a method of staging 
prostate cancer in a human which has such * cancer by 
identifying a human patient having such cancer; analyzing a 
sample of cells, tissues, or bodily fluid from such patient 
5 for CSG; comparing CSG levels in such cells, tissues, or 
bodily fluid with levels of CSG in preferably the same cells, 
tissues, or bodily fluid type of a normal human control 
sample, wherein an increase in CSG levels in the patient 
versus the normal human control is associated with a cancer 
0 which is progressing and a decrease in the levels of CSG is 
associated with a cancer which is regressing or in remission. 

Further provided is a method of monitoring prostate 
cancer in a human having such cancer for the onset of 
metastasis. The method comprises identifying a human patient 
5 having such cancer that is not known to have metastasized; 
periodically analyzing a sample of cells, tissues, or bodily 
fluid from such patient for CSG; comparing the CSG levels in 
such cells, tissue, or bodily fluid with levels of CSG in 
preferably the same cells, tissues, or bodily fluid type of 
a normal human control sample, wherein an increase in CSG 
levels in the patient versus the normal human control is 
associated with a cancer which has metastasized. 

Further provided is a method of monitoring the change 
in stage of prostate cancer in a human having such cancer by 
looking at levels of CSG in a human having such cancer. The 
method comprises identifying a human patient having such 
cancer; periodically analyzing a sample of cells," tissues, or 
bodily fluid from such patient for CSG; comparing the CSG 
levels in such cells, tissue, or bodily fluid with levels of 
CSG in preferably the same cells, tissues, or bodily fluid 
type of a normal human control sample, wherein an increase in 
CSG levels in the patient versus the normal human control is 
associated with a cancer which is progressing and a decrease 
in the levels of CSG is associated with a cancer which is 
regressing or in remission. 
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Further provided are methods of designing new 
therapeutic agents targeted to a CSG for use in imaging and 
treating prostate cancer. For example, in one embodiment, 
therapeutic agents such as antibodies targeted against CSG or 
5 fragments of such antibodies can be used to detect or image 
localization of CSG in a patient for the purpose of detecting 
or diagnosing a disease or condition. . Such antibodies can be 
polyclonal, monoclonal, or omniclonal or prepared by molecular 
biology techniques. The term "antibody", as used herein and 
0 throughout the instant specification is also meant to include 
aptamers and single-stranded oligonucleotides such as those 
derived from an in vitro evolution protocol referred to as 
SELEX and well known to those skilled in the art. Antibodies 
can be labeled with a variety of detectable labels including, 
5 but not limited to, radioisotopes and paramagnetic metals. 
Therapeutics agents such as antibodies or fragments thereof 
can also be used in the treatment of diseases characterized 
by expression of CSG. In these applications, the antibody can 
be used without or with deri va tization to a cytotoxic agent 
0 such as a radioisotope, enzyme, toxin, drug or a prodrug. 

Other objects, features, advantages and aspects of the 
present invention will become apparent to those of skill in 
the art from the following description. It should be 
understood, however, that the following description and the 
specific examples, while indicating preferred embodiments of 
the invention, are given by way of illustration only. Various 
changes and modifications within the spirit and scope of the 
disclosed invention will become readily apparent to those 
skilled in the art from reading the following description and 
from reading the other parts of the present disclosure. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to diagnostic assays and 
methods, both quantitative and qualitative for detecting, 
diagnosing, monitoring, staging and prognosticating cancers 
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by comparing levels of CSG in a human patient with those of 
CSG in a normal human control. For purposes of *the present 
invention, what is meant be CSG levels is, among other things, 
native protein expressed by the gene comprising a 
5 polynucleotide sequence of SEQ ID NO:l, 2, 3, 4, 5, 6, 7, 8, 
9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 or 20. By "CSG" it 
is also meant herein polynucleotides which, due tc degeneracy 
in genetic coding, comprise variations in nucleotide sequence 
as compared to SEQ ID NO: 1-20, but which still encode the 
10 same protein. The native protein being detected, may be 
whole, a breakdown product, a complex of molecules or 
chemically modified. In the alternative, what is meant by 
CSG as used herein, means the native mRNA encoded by the gene 
comprising the polynucleotide sequence of SEQ ID NO:l, 2, 3, 
15 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 or 
20, levels of the gene comprising the polynucleotide sequence 
of SEQ ID NO:l, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 
15, 16, 17, 18, 19 or 20, or levels of a polynucleotide which 
is capable of hybridizing under stringent conditions to the 
20 antisense sequence of SEQ ID NO:l, 2, 3, 4, 5, 6, 7, 8, 9, 10, 
11, 12, 13, 14, 15, 16., 17, 18, 19 or 20. Such -levels are 
preferably • determined in at least one of, cells, tissues 
and/or bodily fluids, including determination of normal and 
abnormal levels. Thus, for instance, a diagnostic assay in 
25 accordance with the invention for diagnosing overexpression 
of CSG protein compared to normal control bodily fluids, 
cells, or tissue samples may be used to diagnose the presence 
of prostate cancer. 

All the methods of the present invention may optionally 
30 include determining the levels of other cancer markers as well 
as CSG. Other cancer markers, in addition to CSG, useful in 
the present invention will depend on the cancer being tested 
and are known to those of skill in the art. 
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Diagnostic Assays 

The present invention provides methods for diagnosing the 
presence of prostate cancer by analyzing for changes in levels 
of CSG in cells, tissues or bodily fluids compared with levels 
5 of CSG in cells, tissues or bodily fluids of preferably the 
same type from a normal human control, wherein an increase in 
levels of CSG in the patient versus the normal human control 
is associated with the presence of prostate cancer. 

Without limiting the instant invention, typically, for 
0 a quantitative diagnostic assay a positive result indicating 
the patient being tested has cancer is one in which cells, 
tissues or bodily fluid levels of the cancer marker, such as 
CSG, are at least two times higher, and most preferably are 
at least five times higher, than in preferably the same cells, 
5 tissues or bodily fluid of a normal human control. 

The present invention also provides a method of 
diagnosing metastatic prostate cancer in a patient having 
prostate cancer which has not yet metastasized for the onset 
of metastasis. In the method of the present invention, a 
0 human cancer patient suspected of having prostate cancer which 
may have metastasized (but which was not previously known to 
have metastasized) is identified. This is accomplished by a 
variety of means known to those of skill in the art. 

In the present invention, determining the presence of CSG 
5 levels in cells, tissues or bodily fluid, is particularly 
useful for discriminating between prostate cancer which has 
not metastasized and prostate cancer which has metastasized. 
Existing techniques have difficulty discriminating between 
prostate cancer which has metastasized and prostate cancer 
which has not metastasized and proper treatment selection is 
often dependent upon such knowledge. 

In the present invention, the cancer marker levels 
measured in such cells, tissues or bodily fluid is CSG, and 
are compared with levels of CSG in preferably the same cells, 
tissue or bodily fluid type of a normal human control. That 
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is, if the cancer marker being observed is just CSG in serum, 
this level is preferably compared with the leve* of CSG in 
serum of a normal human control. An increase in the CSG in 
the patient versus the normal human control is associated with 
5 prostate cancer which has metastasized. 

Without limiting the instant invention, typically, for 
a quantitative diagnostic assay a positive result indicating 
the cancer in the patient being tested or monitored has 
metastasized is one in which cells, tissues or bodily fluid 
10 levels of the cancer marker, such as CSG, are at least two 
times higher, and most preferably are at least five times 
higher, than in preferably the same cells, tissues or bodily 
fluid of a normal patient. 

Normal human control as used herein includes a human 
15 patient without cancer and/or non cancerous samples from the 
patient; in the methods for diagnosing or monitoring for 
metastasis, normal human control may preferably also include 
samples from a human patient that is determined by reliable 
methods to have prostate cancer which has not metastasized. 
2 0 Staging 

The invention also provides a method of staging prostate 
cancer in a human patient. The method comprises identifying 
a human patient having such cancer and analyzing cells, 
tissues or bodily fluid from such human patient for CSG. The 

25 CSG levels determined in the patient are then compared with 
levels of CSG in preferably the same cells, tissues or bodily 
fluid type of a normal human control, wherein an increase in 
CSG levels in the human patient versus the normal human 
control is associated with a cancer which is progressing and 

30 a decrease in the levels of CSG (but still increased over true 
normal levels) is associated with a cancer which is .regressing 
or in remission. 
Monitoring 

Further provided is a method of monitoring prostate 
35 cancer in a human patient having such cancer for the onset of 
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metastasis. The method comprises identifying a human patient 
having such cancer that is not known to have metastasized; 
periodically analyzing cells, tissues cr bodily fluid from 
such human patient for CSG; and comparing the CSG levels 
5 determined in the human patient with levels of CSG in 
preferably the same cells, tissues or bodily fluid type of a 
normal human control, wherein an increase in CSG levels in the 
human patient versus the normal human control is associated 
with a cancer which has metastasized. In this method, normal 
10 human control samples may also include prior patient samples. 

Further provided by this invention is a method of 
monitoring the change in stage of prostate cancer in a human 
patient having such cancer. The method comprises identifying 
a human patient having such cancer; periodically analyzing 
15 cells, tissues or bodily fluid from such human patient for 
CSG; and comparing the CSG levels determined in the human 
patient with levels of CSG in preferably the same cells, 
tissues or bodily fluid type of a normal human control, 
wherein an increase in CSG levels in the human patient versus 
20 the normal human control is associated with a cancer which is 
progressing in stage and a decrease in the levels of CSG is 
associated with a cancer which is regressing in stage or in 
remission. In this method, normal human control samples may 
also include prior patient samples. 
25 Monitoring a patient for onset of metastasis is periodic 

and preferably done on a quarterly basis. However, this may 
be more or less frequent depending on the cancer, the 
particular patient, and the stage of the cancer. 
Assay Technzqu&s 
30 Assay techniques that can be used to determine levels of 

gene expression (including protein levels), such as CSG of the 
present invention, in a sample derived from a patient are well 
known to those of skill in the art. Such assay methods 
include, without limitation, radioimmunoassays, reverse 
35 transcriptase PCR (RT-PCR) assays, immunohis tochemistry 
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assays, in situ hybridization assays, competitive-binding 
assays, Western Blot analyses, ELISA assays afTd proteomic 
approaches: two-dimensional gel electrophoresis (2D 
electrophoresis) and non-gel based approaches such as mass 
5 spectrometry or protein interaction profiling. Among these, 
ELISAs are frequently preferred to diagnose a gene's expressed 
protein in biological fluids. 

An ELISA assay initially comprises preparing an antibody, 
if not readily available from a commercial source, specific 
10 to CSG, preferably a monoclonal antibody. In addition a 
reporter antibody generally is prepared which binds 
specifically to CSG. The reporter antibody is attached to a 
detectable reagent such as radioactive, fluorescent or 
enzymatic reagent, for example horseradish peroxidase enzyme 
15 or alkaline phosphatase. 

To carry out the ELISA, antibody specific to CSG is 
incubated on a solid support, e.g. a polystyrene dish, that 
binds the antibody. Any free protein binding sites on the 
dish are then covered by incubating with a non-specific 
20 protein such as bovine serum albumin. Next, the sample to be 
analyzed is incubated in the dish, during which time CSG binds 
to the specific antibody attached to the polystyrene dish. 
Unbound sample is washed out with buffer. A reporter antibody 
specifically directed to CSG and linked to a detectable 
25 reagent such as horseradish peroxidase is placed in the dish 
resulting in binding of the reporter antibody to any 
monoclonal antibody bound to CSG. Unattached reporter 
antibody is then washed out. Reagents for peroxidase 
activity, including a colorimetric substrate are then added 
30 to the dish.. Immobilized peroxidase, linked to CSG 
antibodies, produces a colored reaction product. The amount 
of color developed in a given time, period is proportional to 
the amount of CSG protein present in the sample. Quantitative 
results typically are obtained by reference to a standard 
35 curve. 
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A competition assay can also be employed wherein 
antibodies specific to CSG are attached to a solids-support and 
labeled CSG and a sample derived from the host are passed over 
the solid support. The amount of label detected which is 
5 attached to the solid support can be correlated to a quantity 
of CSG in the sample. 

Nucleic acid methods can also be used to detect CSG mRNA 
as a marker for prostate cancer. Polymerase chain reaction 
(PCR) and other nucleic acid methods, such as ligase chain 
10 reaction (LCR) and nucleic acid sequence based amplification 
(NASABA) , can be used to detect malignant cells for diagnosis 
and monitoring of various malignancies. For example, reverse- 
transcriptase PCR (RT-PCR) is a powerful technique which can 
be used to detect the presence of a specific mRNA population 
15 in a complex mixture of thousands of other mRNA species. In 
RT-PCR, an mRNA species is first . reverse transcribed to 
complementary DNA (cDNA) with use of the enzyme reverse 
transcriptase; the cDNA is then amplified as in a standard PCR 
reaction. RT-PCR can thus reveal by amplification the 
20 presence of a single species of mRNA. Accordingly, if the 
mRNA is highly specific for the cell that produces it, RT-PCR 
can be used to identify the presence of a specific type of 
cell. 

Hybridization to clones or oligonucleotides arrayed on 
25 a solid support (i.e. gridding) can be used to both detect the 
expression of and quantitate the level of expression of that 
gene. In this approach, a cDNA encoding the CSG gene is fixed 
to a substrate. The substrate may be of any suitable type 
including but not limited to glass, nitrocellulose, nylon or 
30 plastic. At least a portion of the DNA encoding the CSG gene 
is attached to the substrate and then incubated with the 
analyte, which may be RNA or a complementary DNA (cDNA) copy 
of the RNA, isolated from the tissue of interest. 
Hybridization between the substrate bound DNA and the analyte 
35 can be detected and quantitated by several means including but 
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not limited to radioactive labeling or fluorescence labeling 
of the analyte or a secondary molecule designed t*> detect the 
hybrid. Quantitation of the level of gene expression can be 
done by comparison of the intensity of the signal from the 
5 analyte compared with that determined from known standards. 
The standards can be obtained by in vitro transcription of the 
target gene, quantitating the yield, and then using that 
material to generate a standard curve. 

Of the proteomic approaches, 2D electrophoresis is a 
0 technique well known to those in the art. Isolation of 
individual proreins from a sample such as serum is 
accomplished using sequential separation of proteins by 
different characteristics usually on polyacrylamide gels. 
First, proteins are separated by size using an electric 
5 current. The current acts uniformly on all proteins, so 
smaller proteins move farther on the gel than larger proteins. 
The second dimension applies a current perpendicular to the 
first and separates proteins not on the basis of size but on 
the specific electric charge carried by each protein. Since 
0 no two proteins with different sequences are identical on the 
basis of both size and charge, the result of a 2D separation 
is a square gel in which each protein occupies a unique spot. 
Analysis of the spots with chemical or antibody probes, or 
subsequent protein microsequencing can reveal the relative 
5 abundance of a given protein and the identity of the proteins 
in the sample. 

The above tests can be carried out on samples derived 
from a variety of cells, bodily fluids and/or tissue extracts 
such as homogenates or solubilized tissue obtained from a 
patient. Tissue extracts are obtained routinely from tissue 
biopsy and autopsy material. Bodily fluids useful in the 
present invention include blood, urine, saliva or any other 
bodily secretion or derivative thereof. Ey blood it is meant 
to include whole blood, plasma, serum or any derivative of 
blood. 
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In Vivo Targeting of CSGs 

Identification of these CSGs is also useful in the 
rational design of new therapeutics for imaging and treating 
cancers, and in particular prostate cancer. For example, in 
5 one embodiment, antibodies which specifically bind to CSG can 
be raised and used in vivo in patients suspected of suffering 
from prosrate cancer. Antibodies which specifically bind a 
CSG can be injected into a patient suspected of having 
prostate cancer for diagnostic and/or therapeutic purposes. 
10 The preparation and use of antibodies for in vivo diagnosis 
is well known in the art. For example, ant ibody-chelators 
labeled with Indium-Ill have been described for use in the 
radioimmunoscintographic imaging of carcinoembryonic antigen 
expressing tumors (Sumerdon et al. Nucl. Med. Biol. 1990 
15 17:247-254). In particular, these antibody-chelators have 
been used in detecting tumors in patients suspected of having 
recurrent colorectal cancer (Griffin et al . J. Clin. One. 1991 
9:631-640). Antibodies with paramagnetic ions as labels for 
use in magnetic resonance imaging have also been described 
20 (Lauffer, R.B. Magnetic Resonance in Medicine 1991 22:339- 
342). Antibodies directed against CSG can be used in a 
similar manner. Labeled antibodies which specifically bind 
CSG can be injected into patients suspected of having prostate 
cancer for the purpose of diagnosing or staging of the disease 
25 status of the patient. The label used will be selected in 
accordance with the imaging modality to be used. For example, 
radioactive, labels such as Indium-Ill, Technetium-99m or 
Iodine-131 can be used for planar scans or single photon 
emission computed tomography (SPECT) . Positron emitting 
30 labels such as Fluorine-19 can be used in positron emission 
tomography. Paramagnetic ions such as Gadlinium (III) or 
Manganese (II) can be used in magnetic resonance imaging 
(MRI). Localization of the label permits determination of the 
spread of the cancer. The amount of label within an organ or 
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tissue also allows determination of the. presence or absence 
of cancer in that organ or tissue. *" 

For patients diagnosed with prostate cancer, injection 
of an antibody which specifically binds CSG can also have a 
5 therapeutic benefit. The antibody may exert its therapeutic 
effect alone. Alternatively, the antibody can be conjugated 
to a cytotoxic agent such as a drug, toxin or radionuclide to 
enhance its therapeutic effect. Drug monoclonal antibodies 
have been described in the art for example by Garnett and 
0 Baldwin, Cancer Research 1986 46:2407-2412. The use of toxins 
conjugated to monoclonal antibodies for the therapy of various 
cancers has also been described by Pastan et al. Cell 1986 
47:641-648. Yttrium-90 labeled monoclonal antibodies have 
been described for maximization of dose delivered to the tumor 
5 while limiting toxicity to normal tissues (Goodwin and Meares 
Cancer Supplement 1997 80:2675-2680) . Other cytotoxic 
radionuclides including, but not limited to Copper-67, lodine- 
131 and Rhenium-186 can also be used for labeling of 
antibodies against CSG. 

Antibodies which can be used in these in vivo methods 
include polyclonal, monoclonal and omniclonal antibodies and 
antibodies prepared via molecular biology techniques. 
Antibody fragments and aptamers and single-stranded 
oligonucleotides such as those derived from an in vitro 
evolution protocol referred to as SELEX and well known to 
those skilled in the art can also be used. 

Small molecules predicted via computer imaging to 
specifically bind to regions of CSGs can also be designed and 
synthesized and tested for use in the imaging and treatment 
of prostate cancer. Further, libraries of molecules can be 
screened for potential anticancer agents by assessing the 
ability of the molecule to bind to CSGs identified herein. 
Molecules identified in the library as being capable of 
binding to CSG are key candidates for further evaluation for 
use in the treatment of prostate cancer. 
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EXAMPLES 

The present invention is further descriSed by the 
following examples. These examples are provided solely to 
illustrate the invention by reference to specific embodiments. 
5 These exemplifications, while illustrating certain aspects of 
the invention, do not portray the limitations or circumscribe 
the scope of the disclosed invention. 

All examples outlined here were carried out using 
standard techniques, which are well known and routine to those 
10 of skill in the art, except where otherwise described in 
detail. Routine molecular biology techniques of the following 
example can be carried out as described in standard laboratory 
manuals, such as Sambrook et al. f MOLECULAR CLONING: A 
LABORATORY MANUAL, 2nd Ed.; Cold Spring Harbor Laboratory 
15 Press, Cold Spring Harbor, N.Y. (1989). 

Example 1: Identification of CSGs 

Identification of CSGs were carried out by a systematic 
analysis of data in the LIFESEQ database available from Incyte 
Pharmaceuticals, Palo Alto, CA, using the data mining Cancer 

20 Leads Automatic Search Package (CLASP) developed by diaDexus 
LLC, Santa Clara, CA. 

The CLASP performs the following steps: selection of 
highly expressed organ specific genes based on the abundance 
level of the corresponding EST in the targeted organ versus 

25 all the other organs; analysis of the expression level of each 
highly expressed organ specific genes in normal, tumor tissue, 
disease tissue and tissue libraries associated with tumor or 
disease; selection of the candidates demonstrating component 
ESTs were exclusively or more frequently found in tumor 

30 libraries. The CLASP allows the identification of highly 
expressed organ and cancer specific genes.. A final manual in 
depth evaluation is then performed to finalize the CSGs 
selection . 
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Clones depicted in the following Table 1 are CSGs useful 
in. diagnosing, monitoring, staging, imaging and treating 
prostate cancer. 
Table 1: CSGs 



Clone ID 


Pro # 


SEQ ID NO: 


3424528H1 


Prol09 


1,2 


578349H1 


Proll2 


3,4 


1794013H1 


Prolll 


5, 6 


2189835H1 


ProllS 


7,8 


3277219H1 


ProllO 


9, 10 


1857415 


Proll3 


11, 12 


1810463H1 


Proll4 


13,14 | 


zr65Gll 


Proll8 


15,16 


2626135H1 




17 ! 


zd46d08 




18 


1712252H1 




19 


784583H1 




20 



Example 2: Relative Quantitation of Gene Expression 

20 Real-Time quantitative PCR with fluorescent Taqman probes 

is a quantitation detection system utilizing the 5'- 3' 
nuclease activity of Taq DNA polymerase. The method uses an 
internal fluorescent oligonucleotide probe (Taqman) labeled 
with a 5' reporter dye and a downstream, 3' quencher dye. 

25 During PCR, the 5' -3' nuclease activity of Taq DNA polymerase 
releases the reporter, whose fluorescence can then be detected 
by the laser detector of the Model 7700 Sequence Detection 
System (PE Applied Biosystems, Foster City, CA, USA) . 

Amplification of an endogenous control is used to 

30 standardize the amount of sample RNA added to the reaction and 
normalize for Reverse Transcriptase {RT) efficiency. Either 
cyclophilin, glyceraldehyde-3-phosphate dehydrogenase (GAPDH) , 
ATPase, or 18S ribosomal RNA (rRNA) is used as this endogenous 
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control. To calculate relative quantitation between all the 
samples studied, the target RNA levels for one Sample were 
used as the basis for comparative results (calibrator) . 
Quantitation relative to the "calibrator" can be obtained 
5 using the standard curve method or the comparative method 
(User Bulletin #2: ABI PRISM 7700 Sequence Detection System). 

The tissue distribution and the level of the target gene 
were evaluated for every sample in normal and cancer tissues. 
Total RNA was extracted from normal tissues, cancer tissues, 
.0 and from cancers and the corresponding matched adjacent 
tissues. Subsequently, first strand cDNA was prepared with 
reverse transcriptase and the polymerase chain reaction was 
done using primers and Taqman probes specific to each target 
gene. The results were analyzed using the ABI PRISM 7700 
5 Sequence Detector. The absolute numbers are relative levels 
of expression of the target gene in a particular tissue 
compared to the calibrator tissue. 

Expression of Clone ID 3424528H1 (Prol09) : 

For the CSG Prol09, real-time quantitative PCR was 
performed using the following primers: 
Forward Primer: 

5'- ATCAGAACAAAGAGGCTGTGTC - 3 f (SEQ ID NO: 21) 
Reverse Primer: 

5'- ATCTCTAAAGCCCCAACCTTC - 3' (SEQ ID NO: 22) 

The absolute numbers depicted in Table 2 are relative levels 
of expression of the CSG referred to as Prol09 in 12 normal 
different tissues. All the values are compared to normal 
stomach (calibrator) . These RNA samples are commercially 
available pools, originated by pooling samples of a particular 
tissue from different individuals. 
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Table 2: Relative Levels of CSG Prol09 Expression in Pooled 



Samples 



Tissue 


NORMAL 


Colon 


0.02 


Endometrium 


r 0.01 


Kidney 


0.48 


Liver 


14 .83 


Ovary 


0.08 


Pancreas 


4 .38 


Prostate 


11.24 


Small Intestine 


0.42 


Spleen 


0 


Stomach 


1 


Testis 


0.62 


Uterus 


0.02 



The relative levels of expression in Table 2 show that with 
the exception of liver (14.83), Prol09 mRNA expression is 
higher (11.24) in prostate compared with all other normal 
tissues analyzed. Pancreas, with a relative expression level 
20 of 4.38, is the only other tissue expressing considerable mRNA 
for Prol09. 

The absolute numbers in Table 2 were obtained analyzing 
pools of samples of a particular tissue from different 
individuals. They cannot be compared to the absolute numbers 

25 originated from RNA obtained from tissue samples of a single 
individual in Table 3. 

The absolute numbers depicted in Table 3 are relative 
levels of expression of Prol09 in 28 pairs of matching samples 
and 4 unmatched samples. All the values are compared to 

30 normal stomach (calibrator) . A matching pair is formed by 
mRNA from the cancer sample for a particular tissue and mRNA 
from the normal adjacent sample for that same tissue from the 
same individual. 
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Table 3: Relative Levels of CSG Prol09 Expression in 
Individual Samples * 





Sample ID 


Tissue 


Cancer 


Matching 
Normal 

rk-y-l jaLcll I— 




Pro34B 


Prostate 1 


5 . 98 


fifi 


5 


Pro65XB 


Prostate 2 


16.68 


J • O J 




Pro69XB 


Prostate 3 


20.46 






Pro78XB 


Prostate 4 


1.39 


X . *i 




ProlOlXB 


Prostate 5 


24.8 


9 . 8 




Prol2B 


Prostate 6 


9 . 1 


0 . 2 


10 


Prol3XB 


Prostate 7 


0 . 5 


9.7 i 




Pro20XB 


Prostate 8 


13 


12 . 5 


- 


Pro23B 


Prostate 9 


16.8 


3 




OvrlOOOSO 


Ovary 1 


0 . 4 






Ovrl028 


Ovary 2 


1 . 9 




15 


Ovrl8GA 


Ovary 3 




n i 

\J . X 




Ovr206l 


Ovary 4 




n i 

U . X 




Maml2X 


Mammary Gland 1 


13 . 5 


X . T 




Mam47XP 


Mammary Gland 2 


0 . 7 






Lng47XQ 


Lung 1 






20 


Lng60XL 


Lung 2 


7 1 Q 


fi 9 




Lng75XC 


Lung 3 


0.77 


0 .27 




StoAC4 4 


Stomach 1 


0.05 


1.19 




StoAC93 


Stomach 2 


0.55 


0.8 




StoAC99 


Stomach 3 


0.12 


3.04 


25 


ColAS43 


Colon 1 


16.11 


0.07 




ColAS45 


Colon 2 


0.11 


0.06 




ColAS4 6 


Colon 3 


4.99 


0.4 




Livl5XA 


Liver 1 


8.43 


10. 97 




Liv42X 


Liver 2 


1.57 


20.82 
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Liv94XA 


Liver 3 


2.98 


9.19 

m 


Pan77X 


Pancreas 1 


36 


32 


Pan82XP 


Pancreas 2 


0.09 


7.09 


Pan92X 


Pancreas 3 . 


0.7 


0 


Pan71XL 


Pancreas 4 


2.48 


0.73 


Panl0343 


Pancreas 5 


46 


5.5 



0 = Negative 



In the analysis of matching samples, the higher levels 
of expression were in prostate, showing a high degree of 
10 tissue specificity for prostate tissue. Of all the samples 
different than prostate analyzed, only 4 cancer samples (the 
cancer sample mammary 1 with 13.5, colon 1 with 16.11, liver 
1 with 8.43, and lung 2 with 7.39) showed an expression 
comparable to the mRNA expression in prostate. These results 

15 confirmed some degree of tissue specificity as obtained with 
the panel of normal pooled samples (Table 2) . 

Furthermore, the level of mRNA expression was compared 
in cancer samples and the isogenic normal adjacent tissue from 
the same individual. This comparison provides an indication 

20 of specificity for the cancer (e.g. higher levels of mRNA 
expression in the cancer sample compared to the normal 
adjacent) . Table 3 shows overexpression of Prol09 in 6 out 
of 9 primary prostate cancer tissues compared with their- 
respective normal adjacents. Thus, overexpression in the 

25 cancer tissue was observed in 66.66% of the prostate matching 
samples tested (total of 9 prostate matching samples) . 

Altogether, the degree of tissue specificity, plus the 
mRNA overexpression in 66.66% of the primary prostate matching 
samples tested is indicative of Prol09 being a diagnostic 

30 marker for prostate cancer. 
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Expression of Clone ID 578349H1 (Proll2) : 

For the CSG Proll2, real-time quantitative PCR was 
performed using the following primers: 
Forward Primer 
5 5'- TGCCGAAGAGGTTCAGTGC - 3' (SEQ ID NO: 23) 

Reverse Primer 

5'- GCCACAGTGGTACTGTCCAGAT - 3' (SEQ ID NO: 24) 

The absolute numbers depicted in Table 4 are relative 
levels of expression of the CSG Proll2 in 12 normal different 
10 tissues. All the values are compared to normal thymus 
(calibrator). These RNA samples are commercially available 
pools, originated by pooling samples of a particular tissue 
from different individuals. 

Table 4: Relative Levels of CSG Proll2 Expression in Pooled 
15 Samples 



Tissue 


NORMAL 


Brain 


2.9 


Heart 


0.1 


Kidney 


0.2 


Liver 


0.2 


Lung 


7.7 


Mammary 


4.2 


Muscle 


0.1 


Prostate 


5.5 


Small Intestine 


1.8 


Testis 


1 


Thymus 


1 


Uterus 


21 



The relative levels of expression in Table 4 show that 
30 Proll2 mRNA expression is the 3 rd most highly expressed gene 
(after uterus and mammary) in the pool of normal prostate 
tissue compared to a total of 12 tissues analyzed. The 
absolute numbers in Table 4 were obtained analyzing pools of 
samples of a particular tissue from different individuals. 
35 These results demonstrate that Proll2 mRNA expression is 
specific for prostate thus indicating Proll2 to be a 
diagnostic marker for prostate disease especially cancer. 
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Expression of Clone ID 1794013H1 (Prolll) : * 

For the CSG Prolll, real-time quantitative PGR was 
performed using the following primers: 
Forward Primer 

5 5'- GCTGCAAGTTCTCCACATTGA - 3' (SEQ ID NO: 25) 

Reverse Primer 

5'- CAGCCGCAGGTGAAACAC - 3' (SEQ ID NO:26) 

The absolute numbers depicted in Table 5 are relative levels 
of expression of the CSG Prolll in 12 normal different 
10 tissues. All the values are compared to normal testis 
(calibrator) . These RNA samples are commercially available 
pools, originated by pooling samples of a particular tissue 
from different individuals. 

Table 5: Relative Levels of CSG Prolll Expression in Pooled 
15 Samples 



Tissue 


NORMAL 


Brain 


0.04 


Heart 


0 


Kidney 


0 


Liver 


0 


Lung 


0. 05 


Mammary 


0.14 


Muscle 


5166.6 


Prostate 


1483.72 


Small Intestine 


0.33 


Testis 


1 


Thymus 


0.49 


Uterus 


0. 07 



The relative levels of expression in Table 5 show that Prolll 
30 mRNA expression is extraordinarily high in the pool of normal 
prostate (1483.72) compared to all the other tissues analyzed 
with the exception of muscle (5166.6). These results 
demonstrate that Prolll mRNA expression shows specificity for 
prostate and muscle. 
35 The absolute numbers in Table 5 were obtained analyzing 

pools of samples of a particular tissue from different 
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individuals. They cannot be compared to the absolute numbers 
originated from RNA obtained from tissue samples ft'f a single 
individual in Table 6. 

The absolute numbers depicted in Table 6 are relative 
5 levels of expression of Prolll in 48 pairs of matching and 18 
unmatched samples. All the values are compared to normal 
testis (calibrator) . A matching pair is formed by mRNA from 
the cancer sample for a particular tissue and mRNA from the 
normal adjacent sample for that same tissue from the same 
10 individual. 

Table 6: Relative Levels of CSG Prolll Expression in 



Individual Samples 



Sample ID 


Tissue 


Cancer 


Matching 

Normal 
Adjacent 


ProlOlXB 


Prostate 1 


8.3 


21.8 


Prol2B 


Prostate 2 


2336 


133 


Prol3XB 


Prostate 3 


3.4 


23 


Pro20XB 


Prostate 4 


21.6 


121.5 


Pro23B 


Prostate 5 


19.4 


3.7 


Pro34B 


Prostate 6 


15 


39 


Pro65XB 


Prostate 7 


6 


867 


Pro69XB 


Prostate 8 1 


56 


94 


Pro78XB 


Prostate 9 


24 


1515 


Pro.84XB 


Prostate 10 


119 


15.35 


Pro90XB 


Prostate 11 


8 .08 


112.2 


Pro91XB 


Prostate 12 


0.88 


51.8 


ProC215 


Prostate 13 


0.3 




ProC234 


Prostate 14 


0.35 




ProC280 


Prostate 15 


436.5 




Prol09XB 


Prostate 16 


3.43 


265 


ProllO 


Prostate 17 


18.2 


8.73 



WO 00/23111 



PCT/US99/24331 



- 25 - 



Prol25XB 


Prostate 18 


0.34 


186 


Pro326 


Prostate 19 


1392 


110 


ProlOR 


Prostate 20 
(prostatitis) 


0 . 5 




Pro20R 


Prostate 21 
(prostatitis) 


24.1 


- 


rlOZO 0 


Prostate 22 (BPH) 


4 610 




rroz b JC 


Prostate 23 (BPH) 


0 




rro^l O /A 


Prostate 24 (BPH) 


1.46 




rTOZ / 1 A 


Prostate 25 (BPH) 


0 






Prostate Zb (BPH) 


1.47 






Prostate 27 (BPH) 


14.4 




T <s +■ ^ Q V 


Testis 1 


0 


0 




Diaaaer l 


0.44 


0.41 


n±Q,*i OAi\ 


biaaaer z 


0 


0 


Dia D DA 


diaaaer o 


0 


0 


Dxa i ki ft 


biaaaer 4 


0 


0 


MuiU daU 


r\ianey 1 


0 


0 


v-i /-il mvn 

j\iaiu / au 


Kidney Z ■ 


0 


0 


Mai u y au 


Kidney j 


0 


0 


PanlU j4 j 


Pancreas 1 


0 


0 


Pan7 1XL 


Pancreas 2 


0 


0 


ran / /X 


Pancreas 3 


0 


0 


Lxvl5XA 


Liver 1 


0 


0 


lil Vf} ^A 


Liver 2 


0 


0 


ClnAS43 


Colon 1 


0 


0 


ClnAS45 


Colon 2 


0 


0 


ClnAS4 6 


Colon 3 


0 


0 


ClnAS67 


Colon 4 


0 


0 


ClnAC19 


Colon 5 


0 


0 


ClnAS12 


Colon 6 


0 


0 
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0 


0 


c m TU R Q 

oiuxno z) 


ouiaii intestine ^ 


0 


0 


J-iii y f> / At^> 


Lung 1 


0 . 7 


0 


Ling d u Aij 


Lung 2 


0 


0 


T r\r**7 HYP 
LiilCf / DAL 


Lung 3 


0 


0 


Liny y UA 


Lung 4 


0 


0 


Mamiz a 


Mammary Gland 1 


0 


1.4 


M a m Q Q V 


Mammary Gland 2 


0 . 2 


0 


MamAU oa 


Mammary Gland 3 


0 


0 


Mamoi^ / 


Mammary Gland 4 


0 


0 


Mami d^a 


Mammary Gland 5 


0 


0 


M -» m A O HM 

IXianH ^DN 


Mammary Gland 6 


0 


0 


uvriujA 


Ovary 1 


0.14 


0 




\J V d -L y ^ 


U . Z 




Ovrl028 


Ovary 3 


0 




Ovrl040O 


Ovary 4 


0.2 




Ovrl8GA 


Ovary 5 




0 


Ovr206I 


Ovary 6 




0 ! 


Ovr2 0GA 


Ovary 7 




0.2 


Ovr25GA 


Ovary 8 




0 



0= Negative 



In the analysis of matching samples, the higher levels 
of expression were in prostate showing a high degree of tissue 
specificity for prostate. These results confirm the tissue 
specificity results obtained with normal pooled samples (Table 

Furthermore, the level of mRNA expression in cancer 
samples and the isogenic normal adjacent tissue from the same 
individual were compared. This comparison provides an 
indication of specificity for cancer (e.g. higher levels of 
mRNA expression in the cancer sample compared to the normal 
adjacent) . Table 6 shows overexpression of Prolll in 5 out 
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of 16 primary prostate cancer samples compared with their 
respective normal adjacent (prostate samples 2, 5*10, 17, and 
19) . Similar expression levels were observed in 3 unmatched 
prostate cancers (prostate samples 13, 14, 15), 2 prostatitis 
5 (prostate samples 20, 21), and 6 benign prostatic hyperplasia 
samples (prostate samples 22 through 27) . Thus, there is 
overexpression in the cancer tissue of 31.25% of the prostate 
matching samples tested (total of 16 prostate matching 
samples) . 

10 Altogether, the high level of tissue specificity, plus 

the mRNA overexpression in 31.25% of the prostate matching 
samples tested are indicative of Prolll being a diagnostic 
marker for prostate cancer. 

Expression of Clone ID 2189835H1 (ProllS) : 

15 For the CSG ProllS, real-time quantitative PCR was 

performed using the following primers; 
Forward Primer 

5'- TGGCTTTGAACTCAGGGTCA - 3' (SEQ ID NO: 27) 
Reverse Primer 

2 0 5'- CGGATGCACCTCGTAGACAG - 3' (SEQ ID NO: 28) 

The absolute numbers depicted in Table 7 are relative levels 
of expression of the CSG ProllS in 12 normal different 
tissues. All the values are compared to normal thymus 
(calibrator) . These RNA samples are commercially available 
25 pools, originated by pooling samples of a particular tissue 
from different individuals. 

Table 7: Relative Levels of CSG ProllS Expression in Pooled 



Samples 



Tissue 


NORMAL 


Brain 


0.016 • 


Heart 


0. 002 


Kidney 


8.08 


Liver 


2.20 


Lung 


112. 99 
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Mammary 


29.45 


Muscle 


0.05 


Prostate 


337.79 


Small Intestine 


7.54 


Testis 


1.48 


Thymus 


1 


Uterus 


1.4 



The relative levels of expression in Table 7 show that 
Proll5 mRNA expression is higher (337.79) in prostate compared 
10 with all the other normal tissues analyzed. Lung, with a 
relative expression level of 112.99, and mammary (29.446) are 
the other tissues expressing moderate levels of mRNA for 
Proll5. These results establish ProllS mRNA expression to be 
highly specific for prostate. 

15 The absolute numbers in Table 7 were obtained analyzing 

pools of samples of a particular tissue from different 
individuals. They cannot be compared to the absolute numbers 
originated from RNA obtained from tissue samples of a single 
individual in Table 8. 

20 The absolute numbers depicted in Table 8 are relative 

levels of expression of Proll5 in 17 pairs of matching and 21 
unmatched samples. All the values are compared to normal 
thymus (calibrator) . A matching pair is formed by mRNA from 
the cancer sample for a particular tissue and mRNA from the 

25 normal adjacent sample for that same tissue from the same 
individual . 

Table 8 : Relative Levels of CSG ProllS Expression in 



Individual Samples 



Sample ID 


Tissue 


Cancer 


Matching 

Normal 
Adjacent 


Prol2B 


Prostate 1 


1475.9 


190.3 


ProC234 


Prostate 2 


169.61 




Prol09XB 


Prostate 3 




639.53 


ProlOlXB 


Prostate 4 


1985.2 


2882.9 
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Prol3XB 


Prostate 5 


34.9 


13 . 9 


Pro215 


Prostate 6 


525 . 59 




Prol25XB 


Prostate 7 




556 . 05 


Pro23B 


Prostate 8 


1891.4 


1118.6 


ProC280 


Prostate 9 


454.3 




Pm? OXR 








Prril4R 

IT 1 U J T U 


t LUJLaLc J. J. 




*}£9 Q 1 


ProfiSXR 

r L U U J AD 


lLUDLuLC -L t- 




1 J J • UD 


Pro69XR 


Prostatp 13 






Prol OR 


P r n t* pi t* 14 

(prostatitis) 


14? R9 




Pro20R 


Prostate 15 
(prostatitis) 


397.79 




Pro258 


Prostate 16 (BPH) 


216.6 




Pro263C 


Prostate 17 (BPH) 


601 .25 




Pro267A 


Prostate 18 (BPH) 


200.28 




Pro271A 


Prostate 19 (BPH) 


111 .43 




Pro4 60Z 


Prostate 20 (BPH) 


53 .84 




ProC032 


Prostate 21 (BPH) 


56. 94 




SmI21XA 


Small Intestine 1 


28 . 8 


29. 9 


SmIH8 9 


Small Intestine 2 


70.8 


348 .5 


ClnAC19 


Colon 1 


22.73 


446.47 


ClnAS12 


Colon 2 


116. 97 


493. 18 


Kidl06XD 


Kidney 1 


86.13 


41.14 


Kidl07XD 


Kidney 2 


0.26 


35.14 


Lng4 7XQ 


Lung 1 


5.13 


20.98 


Lng60XL 


Lung 2 


13. 93 


114.7 8 


Lng75XC 


Lung 3 


16.47 


53.79 


Maml2X 


Mammary Gland 1 


6.25 


10.75 


Maml62X 


Mammary Gland 2 


1.84 


2.54 


Mam4 2DN 


Mammary Gland 3 


23.08 


35.51 
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OvrlOOSO 


Ovary 1 


0.9 




Ovrl028 


Ovary 2 


261.4 




Ovrl03X 


Ovary 3 


7 


0.1 


Ovr20GA 


Ovary 4 




0 


Ovr25GA 


Ovary 5 




0 



0 = Negative 



Higher levels of expression were seen in prostate, 
showing a high degree of tissue specificity for prostate 

10 tissue. Of all the analyzed samples different from prostate, 
only two cancer samples (colon 2 with 116.97 and ovary 2 with 
261.4 ), and 5 normal adjacent tissue samples (small intestine 
2, colon 1, colon 2, kidney 1, and lung 2), showed an 
expression comparable to the mRNA expression in prostate. 

15 These results confirmed the tissue specificity results 
obtained with the panel of normal pooled samples (Table 7). 

Furthermore, the levels of mRNA expression in cancer 
samples and the isogenic normal adjacent tissue from the same 
individual were compared. This comparison provides an 

20 indication of specificity for the cancer {e.g. higher levels 
of mRNA expression in the cancer sample compared to the normal 
adjacent) . Table 8 shows higher expression of Proll5 in 3 out 
of 4 matched prostate cancer tissues (prostate samples 1, 5 
& 8) . 

25 Altogether, the high level of tissue specificity, plus 

the. higher expression in 75% of the prostate matching samples 
tested, are indicative of ProllS being a diagnostic marker for 
prostate cancer. 

Expression of Clone ID 3277219H1 (ProllO) : 

30 For the CSG ProllO, real-time quantitative PCR was 

performed using the following primers: 
Forward Primer 

5'- CGGCAACCTGGTAGTGAGTG - 3' (SEQ ID NO: 2 9) 
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Reverse Primer 

5'- CGCAGCTCCTTGTAAACTTCAG - 3' (SEQ IB NO: 30) 

The absolute numbers depicted in Table 9 are relative levels 
.of expression of the CSG ProllO in 12 normal different 
5 tissues. All the values are compared to normal small 
intestine (calibrator) . These RNA samples are commercially 
available pools, originated by pooling samples of a particular 
tissue from different individuals. 

Table 9: Relative Levels of CSG ProllO Expression in Pooled 
0 Samples 



Tissue 


NORMAL 


Brain 


6.61 


Heart. 


0.7 


Kidney 


0.74 


Liver 


7 . 94 


Lung 


11.88 


Mammary 


22.78 


Muscle 


6.77 


Prostate 


3.01 


Small Intestine 


1 


Testis | 


2 .58 


Thymus 


13.74 


Uterus 


2.61 



The relative levels of expression in Table 9 show that ProllO 
mRNA expression is not as high in normal prostate (3.01) 
compared with all the other normal tissues analyzed. 

The absolute numbers in Table 9 were obtained analyzing 
pools of samples of a particular tissue from different 
individuals. They cannot be compared to the absolute numbers 
originated from RNA obtained from tissue samples of a single 
individual in Table 10. 

The absolute numbers depicted in Table 10 are relative 
levels of expression of ProllO in 33 pairs of matching 
samples. All the values are compared to normal small 
intestine (calibrator) . A matching pair is formed by mRNA 
from the cancer sample for a particular tissue and mRNA from 
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the normal adjacent sample for that same tissue from the same 
individual . 

Table 10: Relative Levels of CSG ProllO Expression in 



Individual Samples 



5 


Sample ID 


Tissue 


Cancer 


Matching 

Normal 
Adjacent 




Prol2B 


Prostate 1 


11.8 


0.3 




Pro78XB 


Prostate 2 


14.3 


6.3 




ProlOlXB 


Prostate 3 


33.2 


10.7 




Prol3XB 


Prostate 4 


0.3 


0.4 


10 


Pro23XB 


Prostate 5 


25.5 


14 . 4 




Pro20XB 


Prostate 6 


43.3 


4 




Pro34XB 


Prostate 7 


31.8 


18.7 




Pro65XB 


Prostate 8 


26.9 


3.4 1 




Pro69XB 


Prostate 9 


12.5 


7 


15 


Lng75XC 


Lung 1 


1.9 


3 




Lng90X 


Lung 2 


5.5 


0.5 




LngACll 


Lung 3 


9.3' 


9.7 




LngAC32 


Lung 4 


11.2 


2.2 




Lng4 7XQ 


Lung 5 


11.3 


0.3 


20 


Lng60XL 


Lung 6 


29.1 


6.8 




Maml2B 


Mammary Gland 


1 


19.8 


0 




Mam603X 


Mammary Gland 


2 


13.7 


0 




Mam82XI 


Mammary Gland 


3 


73.5 


0 




MamA04 


Mammary Gland 


4 


0 


24.6 


25 


MamBOllX 


Mammary Gland 


5 


17.4 


2 




MamC012 


Mammary Gland 


6 


0 


12. 8 




MamC034 


Mammary Gland 


7 


0 


61 




Maml2X 


Mammary Gland 8 


14 


2.2 




Mam59X 


Mammary Gland 


9 


33 


2.2 
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MamAOflX 


M a mm 3 r" \/ 1 anrl ID 
iia.iiuiia.xy oiaiiu J. \J 


ID. 1 


u . D 


L/l V 1 JAn 


Jj X V Cl X 




n £ 
u . o 


Liv42X 


Liver 2 


7 r i 




Liv94XA 


Liver 3 


0.4 


1.4 


ClnAS43 


Colon 1 


52. 9 


1.4 


ClnAS45 


Colon 2 


2.1 


0.8 


ClnAS4 6 


Colon 3 


39.8 


3.7 


SmI21X 


Small Intestine 1 


0.9 


0.1 


SmIH89 


Small Intestine 2 


5.8 


0.9 



10 0 = Negative 

The levels of mRNA expression in cancer samples and the 
isogenic normal adjacent tissue from the same individual were 
compared. This comparison provides an indication of 
specificity for the cancer (e.g. higher levels of mRNA 

15 expression in the cancer sample compared to the normal 
adjacent) . Table 10 shows overexpression of ProllO in 8 of 
the 9 primary prostate cancer tissues compared with their 
respective normal adjacent (except prostate 4) . Thus, there 
was overexpression in 88.88% of the cancer prostate tissue 

20 as compared to the prostate matching samples tested (total of 
9 prostate matching samples). 

Although not tissue specific, ProllO mRNA expression is 
upregulated in prostate cancer tissues. The mRNA 

overexpression in 88.88% of the primary prostate matching 

25 cancer samples tested is indicative of ProllO being a 
diagnostic marker for prostate cancer. ProllO also showed 
overexpression in several other cancers tested including small 
intestine, colon, liver, mammary and lung (see Table 10) . 
Accordingly ProllO may be a diagnostic marker for other types 

30 of cancer as well. 
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Expression of Clone ID 1857415; Gene ID 346880 (Proll3) : 

For the CSG Proll3, real-time quantitative PCR was 
performed using the following primers: 
Forward Primer 

5'- CGGGAACCTACCAGCCTATG - 3' (SEQ ID NO: 31) 
Reverse Primer 

5'- CAGGCAACAGGGAGTCATGT -3' (SEQ ID NO:32) 

The absolute numbers depicted in Table 11 are relative levels 
of expression of the CSG Proll3 in 12 normal different 
tissues. All the values are compared to normal thymus 
(calibrator) . These RNA samples are commercially available 
pools, originated by pooling samples of a particular tissue 
from different individuals. 

Table 11: Relative Levels of CSG Proll3 Expression in 

Pooled Samples 



Tissue 


NORMAL 


Brain 


0.03 


Heart 


0 


Kidney 


0.01 


Liver 


0 


Lung 


0 


Mammary Gland 


0 


Muscle 


0.04 


Prostate 


489.44 


Small Intestine 


0.02 


Testis 


0.35 


Thymus 


1 


Uterus 


0.13 



The relative levels of expression in Table 11 show that Proll3 
mRNA expression is higher (489.44) in prostate compared with 
all the other normal tissues analyzed. Testis, with a 
relative expression level of 0.35, uterus (0.13), thymus 
(1.0), kidney (0.01) and brain (0.03) were among the other 
tissues expressing lower mRNA levels for Proll3. These 
results establish that Proll3 mRNA expression is highly 
specific for prostate. 
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The absolute numbers in Table 11 were obtained analyzing 
pools of samples of a particular tissue fror* different 
individuals. They cannot be compared to the absolute numbers 
originated from RNA obtained from tissue samples of a single 
5 individual in Table 12. 

The absolute numbers depicted in Table 12 are relative 
levels of expression of Proll3 in 78 pairs of matching and 25 
unmatched tissue samples. All the values are compared to 
normal thymus (calibrator) . A matching pair is formed by mRNA 

10 from the cancer sample for a particular tissue and mRNA from 
the normal adjacent sample for that same tissue from the same 
individual. In cancers (for example, ovary) where it was not 
possible to obtain normal adjacent samples from the same 
individual, samples from a different normal individual were 

15 analyzed. 

Table 12: Relative Levels of CSG Proll3 Expression in 



Individual Samples 



Sample ID 


Tissue 


Cancer 


Matched or 
Unmatched 

Normal 
Adjacent 


Pro780B/781B 


Prostate 1 


375.53 


446.29 


Prol291B/1292B 


Prostate 2 


1060 


31 


Prol39B96/140B96 


Prostate 3 


41 


32 


Pro209B96/210B96 


Prostate 4 


505 


255 


Prpl256B/1257B 


Prostate 5 


165.79 


141.63 


Pr6l293B/1294B 


Prostate 6 


1613.7 


874 . 61 


Pro694B/695B 


Prostate 7 


458 . 6 


142.21 


Prol012B/1013B 


Prostate 8 


1520 


864 


Prol222B/1223B 


Prostate 9 


939 


530 


Pro845B/846B 


Prostate 10 


1552.4 


374 .6 


Prol094B/1095B 


Prostate 11 


278 .37 


135.89 


Pro650B/651B 


Prostate 12 


532.81 


640.85 



WO 00/23111 



PCT/US99/24331 



- 36 - 





Pro902B/903B 


Prostatp 1 "3 


v? \) -? . Uj 


ft X 0 . O D 




Pro916B/917B 


Prostate 14 


699.42 


*" 

401.24 




Pro9821110A/110B 


Prostate ' 15 


156 


487. 8 




ProS9821326A/26B 


Prostate 16 


744 . 4 


472. 8 


5 


Pro9407c215 


Prostate 17 


1389.2 






Pro9407c234 


Prostate 18 


305. 5 






Pro9407c280A 


Prostate 19 


894 .5 






Pro9409C010R 


Prostate 20 
(prostatitis) 


269.7 






Pro9404C120R 


Prostate 21 
(prostatitis) 


299.2 




10 


Prol000258 


Prostate 22 
(BPH) 


149. 6 






Pro4001263C 


Prostate 23 
(BPH) 


576 






Pro4001267A 


Prostate 24 
(BPH) 


132.1 






Pro9411C032 


Prostate 25 
(BPH) 


118.2 






Pro40014602 


Prostate 26 
(BPH) 


276.3 




15 


Pro4001271A 


Prostate 27 

\ Dtrn) 


58 .7 






Kidl064 D/ 65D 


rvx HI 1 Cy X 


U 


U . 1 




Kid! 07 QH / 1 DRfin 


fviuney z 


U . J 


0 . 02 




Kid! 097 0/1 OQRn 




J J . 14 


n n 

U . J 2 




Kid! o?4 n/i o?sn 


T /"i Y"l A T » / 

jMuney *i 




0 


20 


Kidll83D/1184D 


Kidnev 5 


? 4 7 Q 


n 




Kidl242D/1243D 


Kidney 6 


0 


0 




Bld469K 


Bladder 1 




2.88 




Bld467K/468K 


Bladder 2 


2. 65 






Bld327K/328K 


Bladder 3 


0 


4 .05 


25 


Bld470K 


Bladder 4 




1.64 




Bld665T/664T 


Bladder 5 


0.21 


1.99 
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DlQl^yotS./ / t\ 


DlaUQer D 


19 ^ ^ 
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a! H1991K/1 "799^ 
DlUl / Z lrv/ x f £.c.J\ 


Didaaer / 




1 9 A 




Testis 1 


jl . j 






i estis z 


lo . / 


U 


TstS9820663A/663E 


Testis 3 


72 


1.4 


SknS9821248A/248B 


Skin 1 


1.8 


0.5 


SknS99448A/448B 


Skin 2 


251 . 6 


0 


Skn99816A/816B 


Skin 3 


33 


0.7 


Sto4004864A4/B4 


Stomach 1 


14 . 12 


0 


Sto4004509A3/Bl 


Stomach 2 


40.74 


39 


SmI9807A212A/213A 


Small 

Intestine 1 


0 . 1 


0 


SmI9802H008/H009 


Small 

Intestine 2 


5.8 


0.1 


LinyouoDUii/DUii 


Loion i 




u 


tiny / u y c u / ua / u / j ra 


Colon 2 


bo . o 


J . 1 




Colon 3 


I . 1 


u . y 


Liny^uoLi y y / lzuu 


Loion 4 


9/1 1 £ 
OH . / 0 


U . / J 


tiny /u / cuu^gD/ uuoga 


Colon 5 


y U . 26 


0.96 


iinyD-uy-DUU^ / duUj 


Loion o 


17 . 9 


20.64 


Liny bi^DUUo/ buu j 


Loion / 


1 / . 56 


0 . 3 


ciny / UoFUUzD/ FU U XL 


Colon 8 


21.39 


0 


LlnCXGA 


Colon 9 


429.14 


142.69 


PanlU 3 4 3a 


Pancreas 1 


0 


0 


Dan7"7^D / 9 9 9 D 

Iran / for / / a / r 


Pancreas 2 


0 


0.15 


Pan9210/9220 


Pancreas 3 


7.36 


0 


Pan714L/715L 


Pancreas 4 


13.57 


0.11 


Pan824P/825P 


Pancreas 5 


0 


0 


Lng476Q/477Q 


Lung 1 


0 


0 


Lng605L/606L 


Lung 2 


0 


0.1 


Lnglll45B/11145C 


Lung 3 


85. 9 ■ 


0 



WO 00/23111 



PCT/US99/24331 



- 38 - 





Lung 4 


9 "3 p Q 


U 




Lung 5 


A 9 9 


U . 2 5 




Lung 6 




0 




Lung 7 


1 J . j / 


0 




Lung 8 


2 6.17 


0 




Lung 9 


0 . 68 


0 




Lung 1U 


0 


0 




Mammary Gland 1 


8 . 5 


0 


l Jet III sJ 


Mammary bland ^ 


61.07 


0 


rlalily / UDrtUDDb/ D / 


Mammary Gland 3 


4.84 


0 


Maml £ R 7 1 f 
ridlllx *i lJjalL 


Mammary Gland 4 


9.72 


6.99 


1 JdillX U<1U£ / ID^llC 


Mammary oiano o 


u . y i 


0 


rudlUU U U ± H U\J D 


Mammary Gland 6 


9 / C 

2.45 


0 




Endometrium 1 


1 J J . 4 3 


1 . 12 


FnHQ70SZ\1 9 R A / 1 9 


M /-J AfM ^fc ^ V * 1 • w> O 

ijiiaomemum z 


a 
U 


A "J A 

0.39 


CjI 1U .7 / U4^Z0 lei / Z 0 <in 


Endometrium 3 


23.5 


1 . 56 




Endometrium .4 


O O G A 

a 8 . c 9 


79 . 02 


ri{- r i ^CQn / I 9 R Rn 


uterus 1 


0 . 2 


0 


U UIO OUU / 0 O 1 U 


Uterus 2 


0 


0 


ni- r 1 di 7n/i 41 Rn 

U LI14 1 IKjf ± H ±0V 


uterus J 


14 


0 . 4 


Hfr9?7nQK / 9 9 4 n Q £ 


Uterus h 


8 . 65 


4 . 64 


VX VINrJUU UD^UUl / O^NUl 


Cervix 1 


0 . 82 


77.15 


VX V CMIYI UUUo jUUi / 0 JNUi 


Cervix 2 


0.78 


221 . 48 


CvxND0002 3D01 /23N01 


^Ci VIA J 


9 9 c, 


1 C A 9 

13.22 


Ovrl037O/1038O 


Ovary 1 


0.1 


0 


Ovrl005O 


Ovary 2 


18.96 




Ovrl028 


Ovary 3 


0 




Ovrl4638AlC 


Ovary 4 


3.2 




Ovrl4603AlD 


Ovary 5 


882.3 




Ovr7730 


Ovary 6 


0 
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Ovr9702C018GA 


Ovary 7 




0.15 


Ovr206I 


Ovary 8 




^ 

0 


Ovr9702C020GA 


Ovary 9 




0 


Ovr9702C025GA 


Ovary 10 




0 


Ovr9701C035GA 


Ovary 11 




0.07 


Ovr9701C050GB 


Ovary 12 




0.58 



0 = Negative 



In the analysis of matching samples, the higher levels 
of expression were in prostate, showing a high degree of 
10 tissue specificity for prostate tissue. In addition to the 
higher expression levels in prostate cancer samples, Proll3 
expression was found to be either induced (where not expressed 
in normal adjacent tissues) or somewhat upregulated in several 
other cancers. However, the relative expression and the fold 
15 increase in prostate cancer samples far exceeds that in other 
cancer tissues and is highly significant. 

Furthermore, the levels of mRNA expression in cancer 
samples and the isogenic normal adjacent tissue from the same 
individual were compared. This comparison provides an 
20 indication of specificity for the cancer (e.g. higher levels 
of mRNA expression in the cancer sample compared to the normal 
adjacent) . Table 12 shows overexpression of Proll3 in 13 out 
of 16 primary prostate cancer tissues compared with their 
respective normal adjacent (prostate samples 2, 3, 4, 5, 6 7, 
25 8, 9, 10, 11, 13, 14, 16). Thus, there was overexpression in 
the cancer tissue for 81.25% of the prostate matching samples 
tested. The median for the level of expression in prostate 
cancer tissue samples is 609, whereas the median for all other 
cancers is only 7.93, with the exception of one colon sample, 
0 colon 9, whose expression was similar to that found in 
prostate cancer tissues. 

Altogether, the high level of tissue specificity, plus 
the mRNA overexpression in 81.25% of the primary prostate 
matching samples tested are indicative of Proll3 being a 
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diagnostic marker for prostate cancer. Expression was also 
found to be higher in other cancer tissues compared with their 
respective normal adjacent tissues (kidney, bladder, testis, 
skin, stomach, small intestine, colon, pancreas, lung, 
5 mammary, endometrium, uterus, and ovary) thus indicating 
Proll3 to be a pan cancer marker. 

Expression of Clone ID 1810463H1 <Proll4) : 

For the CSG Proll4, real-time quantitative PCR was 
performed using the following primers: 
10 Forward Primer 

5'- TGGGCATCTGGGTGTCAA - 3' (SEQ ID NO: 33) 
Reverse Primer 

5'- CGGCTGCGATGAGGAAGTA - 3' (SEQ ID NO: 34) 
The absolute numbers depicted in Table 13 are relative 
15 levels of expression of the CSG Proll4 in 12 normal different 
tissues. All the values are compared to normal muscle 
(calibrator) . These RNA samples are commercially available 
pools, originated by pooling samples of a particular tissue 
from different individuals. 
2 0 Table 13: Relative Levels of CSG Proll4 Expression in 



Pooled Samples 



Tissue 


NORMAL 


Brain 


9.7 


Heart 


.0-7 


Kidney 


414 . 4 


Liver 


4 


Lung 


882.2 


Mammary 


44 


Muscle 


1 


Prostate 


1951 


Small Intestine 


22 


Testis 


367 .1 


Thymus 


25.8 


Uterus 


139.6 



35 The relative levels of expression in Table 13 show that Proll4 
mRNA expression is higher (1951) in prostate compared with all 
the other normal tissues analyzed. Lung, with a relative 
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expression level of 882.2, kidney 414.4, testis 367.1 and 
uterus 139. 6 f are the other tissues expressing higher levels 
of mRNA' for Proll4 . These results establish Proll4 mRNA 
expression to be more specific for prostate than other tissues 
5 examined. 

The high level of tissue specificity is indicative of 
Proll4 being a diagnostic marker for diseases of the prostate, 
especially cancer. 

Expression of Clone ID zr65gll (Proll8) : 

10 For the CSG Proll8, real-time quantitative PCR was 

performed using the following primers: 
Forward Primer 

5'- GCCCATCTCCTGCTTCTTTAGT - 3' (SEQ ID NO: 35) 

Reverse Primer 

15 5'- CGTGGAGATGGCTCTGATGTA - 3' (SEQ ID NO: 36) 

The absolute numbers depicted in Table 14 are relative 
levels of expression of the CSG Proll8 in 12 normal different 
tissues. All the values are compared to normal kidney 
(calibrator) . These RNA samples are commercially available 
20 pools, originated by pooling samples of a particular tissue 
from different individuals. 

Table 14: Relative Levels of CSG Proll8 Expression in 



Pooled Samples 



Tissue 


NORMAL 


Colon 


0.87 


Endometrium 


19282 


Kidney 


1 


Liver 


0 


Ovary 


86.22' 


Pancreas 


0 


Prostate 


962.1 


Small Intestine 


0 


Spleen 


0.75 


Stomach 


0.54 


Testis 


343.7 


Uterus 


1064 
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The relative levels of expression in Table *4 show that 
Proll8 mRNA expression is the 3 rd highest in prostate (962.1) 
next to endometrium (19282) and uterus (1064), which are 
female-specific tissues. Testis, with a relative expression 
5 level of 343.7 is the only other male tissue expressing 
moderate levels of mRNA for Proll8. These results establish 
Proll8 mRNA expression to be highly specific for reproductive 
tissues including the prostate. 

The absolute numbers in Table 14 were obtained analyzing 
10 pools of samples of a particular tissue from different 
individuals. They cannot be compared to the absolute numbers 
originated from RNA obtained from tissue samples of a single 
individual in Table 15. 

The absolute numbers depicted in Table 15 are relative 
15 levels of expression of Proll8 in 59 pairs of matching and 21 
unmatched samples. All the values are compared to normal 
kidney (calibrator) . A matching pair is formed by mRNA from 
the cancer sample for a particular tissue and mRNA from the 
normal adjacent sample for that same tissue from the same 
20 individual. 



Table 15: Relative Levels of CSG Proll8 Expression in 

Individual Samples 



Sample ID 


Tissue 


Cancer 


Matching 

Normal 
Adjacent 


Prol2B 


Prostate 1 


41700.7 


22242. 83 


ProC234 


Prostate 2 


40087 




Pro78XB 


Prostate 3 


4075. 6 


7066.7 


Prol09XB 


Prostate 4 


334.4 


777 .2 


Pro84XB 


Prostate 5 


11684 


.58290 


ProlOlXB 


Prostate 6 


21474.13 


100720.8 


Pro91X 


Prostate 7 


14849 


33717 


Prol3XB 


Prostate 8 


202.57 


146.91 
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Pro65XB 


Prostate 17 


10126 


11270 


Pro69XB 


Prostate 18 
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Pro326 
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(prostatitis ) 
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In the analysis of matching samples, the higher levels of 
25 expression were in prostate, endometrium, testis, and ovary 
showing a high degree of tissue specificity for reproductive 
tissues. These results confirmed the tissue specificity 
results obtained with the panel of normal pooled samples 
(Table 14) . 

30 Furthermore, the levels of mRNA expression in cancer 

samples and the isogenic normal adjacent tissue from the same 
individual were compared. This comparison provides an 
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indication of specificity for the cancer (e.g. higher levels 
of mRNA expression in the cancer sample compared tft. the normal 
adjacent) . Table 15 shows overexpression of Proll8 in 5 out 
of 14 primary prostate cancer tissues (prostate samples i, 8, 
5 10, 11, 15) compared with their respective normal adjacent. 
Thus, there was overexpression in the cancer tissue for 35.71% 
of the prostate matching samples tested (total of 14 prostate 
matching samples) . Expression of Proll8 was similarly higher 
in 3 unmatched cancer tissues (prostate samples 9, 13, 14), 

10 2 prostatitis {prostate samples 20, 21), and 6 benign 
hyperplasia tissues (prostate samples 22 through 27) . 

Altogether, the high level of tissue specificity, plus 
the mRNA overexpression in 35.71% of the primary prostate 
matching samples tested are indicative of Proll8 being a 

15 diagnostic marker for prostate cancer. 
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What is claimed is : 

1. A method for diagnosing the presence «of prostate 
cancer in a patient comprising: 

(a) determining levels of CSG in cells, tissues or bodily 
5 fluids in a patient; and 

(b) comparing the determined levels of CSG with levels 
of CSG in cells, tissues or bodily fluids from a normal human 
control, wherein a change in determined levels of CSG in said 
patient versus normal human control is associated with the 

0 presence of prostate cancer. 

2. A method of diagnosing metastases of prostate cancer 
in a patient comprising: 

(a) identifying a patient having prostate cancer that is 
not known to have metastasized; 
5 (b) determining CSG levels in a sample of cells, tissues, 

or bodily fluid from said patient; and 

(c) comparing the determined CSG levels with levels of 
CSG in cells, tissue, or bodily fluid of a normal human 
control, wherein an increase in determined CSG levels in the 
patient versus the normal human control is associated with a 
cancer which has metastasized. 

3. A method of staging prostate cancer in a patient 
having prostate cancer comprising: 

(a) identifying a patient having prostate cancer; 

(b) determining CSG levels in a sample of cells, tissue, 
or bodily fluid from said patient; and 

(c) comparing determined CSG levels with levels of CSG 
in cells, tissues, or bodily fluid of a normal human control, 
wherein an increase in determined CSG levels in said patient 
versus the normal human control is associated with a cancer 
which is progressing and a decrease in the determined CSG 
levels is associated with a cancer which is regressing or in 
remission. 
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4 . A method of monitoring prostate cancer in a patient 
for the onset of metastasis comprising: * 

(a) identifying a patient having prostate cancer that is 
not known to have metastasized; 

5 (b) periodically determining levels of CSG in samples of 

cells, tissues, or bodily fluid from said patient; and 

(c) comparing the periodically determined CSG levels with 
levels of CSG in cells, tissues, or bodily fluid of a normal 
human control, wherein an increase in any one of the 
10 periodically determined CSG levels in the patient versus the 
normal human control is associated with a cancer which has 
metastasized. 

5. A method of monitoring a change in stage of prostate 
cancer in a patient comprising: 

15 (a) identifying a patient having prostate cancer; 

(b) periodically determining levels of CSG in cells, 
tissues, or bodily fluid from said patient; and 

(c) comparing the periodically determined CSG levels with 
levels of CSG in cells, tissues, or bodily fluid of a normal 

20 human control, wherein an increase in any one of the 
periodically determined CSG levels in the patient versus the 
normal human control is associated with a cancer which is 
progressing in stage and a decrease is associated with . a 
cancer which is regressing in stage or in remission. 

25 6. A method of identifying potential therapeutic agents 

for use in imaging and treating prostate cancer comprising 
screening molecules for an ability to bind to CSG wherein the 
ability of a molecule to bind to CSG is indicative of the 
molecule being useful in imaging and treating prostate cancer. 

30 7. The method of claim 1, 2, 3, 4, 5 or 6 wherein the 

CSG comprises SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 
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13, 14, 15, 16, 17, 18, 19 or 20 or a polypeptide encoded 
thereby. * 

8. An antibody which specifically binds CSG. 

9. A method of imaging prostate cancer in a patient 
comprising administering to the patient an antibody of claim 
8, 

10. The method of claim 9 wherein said antibody is 
labeled with paramagnetic ions or a radioisotope. 

11. A method of treating prostate cancer in a patient 
comprising administering to the patient an antibody of claim 
7. 

12. The method of claim 11 wherein the antibody is 
conjugated to a cytotoxic agent. 
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<210> 1 

<211> 188 

<212> DNA 

<213> Homo sapiens 

<400> 1 

ggtaaacacc tgcttttatc atcagaacaa agaggctgtg tcccctgccc tatgaggtcc 60 

atttctgaga gttgtggcta atgggcaaga aggttggggc tttagagatt tgggataaag 120 

atatcaaaca ccagaaaggt agaaagaagt gatcagatta gggttactta ggtgatgata 190 
tgaactct 188 

<210> 2 

<211> 9819 

<212> DNA 

<213> Homo sapiens 

<400> 2 

cagctggggt ctacccaggt ccatgtcttg gacatgttga gagtttttct ggaaggcagg 60 
gatacagtgt ggtccaaaaa cacacaaatg cccctactgg cccaggggtt gtcacaatag 12 0 
actggaaggg tgacacatcc caggcgcttg ccacccatca cacgcacctc ctacccactg 180 
gcatccttcc accccaggca cacacaaagc ctcagtccag agatcaactc tggactcagc 240 
tctgaatttg catatcctgt gtgtagattc attcttcata acctctgccc agcctagctt 300 
gtgtatcatt tttttttctc tattagggga ggagcccgtc ctggcactcc cattggcctg 360 
tagattcacc tcccctgggc agggccccag gacccaggat aatatctgtg cctcctgccc 420 
agaaccctcc aagcagacac aatggtaaga atggtgcctg tcctgctgtc tctgctgctg 480 
cttctgggtc ctgctgtccc ccaggagaac caagatggtg agtggggaaa gcaagggatg 540 
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ggtgccggag aggactggaa ggaggtgagg aacaggacat gtggctggga gacaggctgg 600 
atgcagctgg gataccctgg catacggcag gaatgggtgc ccaaggctgt caactccctc 660 
agctcacaca cttccaggag cattcaggga gcctctgcgc tggcccgaaa taagaccctc 720 
aggaatctga atctaaaacc cctagtttac agtgaaaaca aagactccaa agaccaagcg 780 
acctgcttgg ggtagacagt caggacggag taggaaccat atgcctggag ctgcttctgc 64 0 
tcctgttcct tccctccttc cgatggctgg gtacacctgc ctgacgctga ggaaaagaga 900 
gagcagcccc aaggggaaag tgggaaggca ggttggctgg agggatggtg ctagaaggaa 960 
acccgtgccc aaaccccaca ctcagacacc actgcagtgg gtctggaagg cgagtggctg 1020 
gaagagaaga gagtgggagc tccgggagat caagagtcac tcctaggata agggaaggag 1080 
gctgtttgtg gcatgagaat gtgcaggata aagacacgga agcgaatggc ttctcagttg 114 0 
tgtgagttta aaattcatga catttacaaa ttgtcagaaa aggtgttata tgtttgttat 1200 
ataacaatca ctttggaatg ttaatctgat tctgtgccaa aatctgaatt actcagggtt 1260 
ctccagagaa acagaactaa taggtggtac acatatacat atatatgtac gtacacatac 1320 
atacatacac. tgtatacaca tggatacaca cacacatagg aagagattta catatatgta 1380 
tacaaaagag agagagagta gagatttatt ttaagaaatt gactcacacn attgggagga 144 0 
gtaacaagtc ctaaatcttc agagccggcc agcaggctgg agacccaggg aagagttgat 150 0 
gtcttagtct tgattccaag ggcagactgt aggcagaatt ctttcctctt taggggacat 1560 
ctgaggcttt ttctcttaag gccttcaact gattggatga agcccaccac catggagagc 1620 
aatccacttt actcaaggtc tactgatttt tttgtaaatt aaaaaaaaaa ctgtgggtgc 1680 
atagtatgtg tatatattta tggggtacat gagaggtntt gattcaggca tgcaatgtga 1740 
aataatcaca tcatcaaaaa tgaggtatcc atcccttcaa gctttcatcg tttgtgttac 1800 
agacaatcca attatacttt tttggttatt ttagttttta aaagtatttg attatttatt 1850 
tatttattta tttttgagac agagtctcac tctgtcaccc aggcaggagt gcagcggcar 1920 
gatctcggct cactgcaacc cccgcctccc aggttcaagc aactttcctg cctcagtctc 1980 
ctgagtagct aggactacag gcacctgcca ccacacctgg ctaatttttt tgtattttta 2040 
gtagagacgg tttcatcatg ttggccaggc tagtcttgat accctgacct cgtgatctgc 2100 
ccgccttggt ctcccaaagc gccgggatta caggtgtcag caactgcgcc tggcctctct 2160 
tttggttatt taaaagtgta caattaaatt atgattatta ttattatttt tgagatggat 2220 
tcttgttctg tcacccaggc tggagtgcag tggcgtgatc ttggcttact gcaaacctcc 2280 
gcctgttggg ttcaagcaat tatcttgcct cgggtgtaca ctgccacaca cggctaactt 2340 
atgtattttt aatagagata gggcttcacc atgttggcta gactggtctt gacctcttga 2400 
cctcaagtga tccactcact tcagcctccc agagtgctgg aattacaggc acgagccacc 2460 
acacctggcc ccagttaaat tattattgac tatagtcacc ctgttgtgct atcaaatagt 2520 
aggtcttatt cattcttctt tttttttttt tttctgtgac agagttgccc aggctggaat 2580 
gcagtggtgc aatcttggct cactgcaacc tctgcctccc gggcttaagc gattctcctg 2640 
cctcagcctt ctgagtcgct gggactacag gtgtgtgcca ccacgcccgg ctaatttatg 2700 
tatttttagt agagatgggg tttcaccatg ttggccaggc tggtttcgaa ctcctgacct 2760 
caagtgaccc acctgcctca gcttcccaaa gtgttggaat tacaggcatg agccaccaca 2820 
cctggcccca gttaaattat tattcactgg agtcactttg ttgtgctatc aaatagtttt 2880 
ctaactattt tttttgtacc cattaaccac cctcccaatt tccccccaac cctgccacta 2940 
cccttcccag cctttggtaa ccatccttct actctctatg tccatgaatt caattgtagg 3000 
gtctactgat ttaaaggcta atcacattta gacactcagg agcaagaata attttagtaa 3060 
ttgaactagg attctgccat atgacctcca acatcattag cacctgtgta aattgtatca 3120 
taaaataatt atggaactat tatggaaatg tccctctctc ccagatccca ccttgtacca 3180 
aaatgcaagg tacaaccccg ggaattctga gctccatcct agtcttaccc tgtgctaatt 3240 
cagtctgggt catttcttga attttctggt aaattctcct ttctaccctt tctaactata 3300 
tgtatttgtc aggttaagct agaagtgtta attttttttt tttttgagat ggagccttgc 3360 
tttgtcacct aggctgaagt gcagtggcat gatctcagct cactgcaagc tccgcctccc 3420 
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gggttcatgc cattctcctg cctcagcctc ctgagtagct gggactacag gcacccgcca 34 80 
ccatgcttgg ctaatttttt gaactcttag tagagacggg gtttcaccat gttagccagg 3540 
atggtctcga tctcctgacc tcgtgatcca cccgcctcgg ccccctaaag tgctgggatt 3600 " 
acaggcgtga gccactgagc ccggacgaaa tgttaatttg ttttttttga gacggagtct 3660 
cactctgtca tccaagctgg agtgcagtgg catgatcttg gcttgttgca aJfctctgcct 3720 
ctctggttca agtgattttc ctgcctcagc ctccagcatg accgggatta caggcccgca 3780 
ccaccatgcc cagctaattt ttgtattttt taatagagat ggggtttcac catgttggcc 3 840 
aggctggtct tcaacccctg atctcaagta atctgcctgc cttggccccc caaagtcctg 3 900 
ggattacagg catgagccac ggagcccagc ctagaaatgt taatttctaa cgcatgtcag 3 960 
attccatgca cactgggcaa ggttccattc ctccatgggg tgactcaggg atccaggcca 4020 
attgcatatt gagactcttt catattatcc tgtggccttc aaagccgtca cctctaggga 4080 
tgagaaacaa aagggaaagc cagctggtag ggtcttggac aagaagaaag acatcacttc 414 0 
tgctcacatt ctcttttgac aaaactcagt cacatggtcc caatatatct tcgaggtggc 4200 
tgagtaatgt tatcttccta tgtgtcaagc agaggaaata atgtagtgaa gacacaggat 4260 
ggtctctgaa atatcatctc aggcatgaaa gtagagcata ttcacttgag tgagcctcca 4320 
gtggtgtgaa gttgatggca ggagaaagag ctggggaaga aaaggccagt ggcaggcctc 4380 
ccctcctagc cctatgcagc cccacagtgg gacccttgca tggacctcaa ccatcagaat 4440 
cttttctttt gcaggtcgtt actctctgac ctatatctac actgggctgt ccaagcatgt 4500 
tgaagacgtc cccgcgtttc aggcccttgg ctcactcaat gacctccagt t ctttagata, 4560 
caacagtaaa gacaggaagt ctcagcccat gggactctgg agacaggtgg aaggaatgga 4620 
ggattggaag caggacagcc aacttcagaa ggccagggag gacatcttta tggagaccct 4680 
gaaagacatt gtggagtatt acaacgacag taacggtcag tgaataacag accacagggg 4740 
tggaaggtct aacccaagag gcagcccccc cagtgtgagt ggcaagggat cagcaggatg 4 800 
gaaatagtcc caatcccagg ggaagaacag gagacacagc agaaacacag acatgtccgc 4 860 
atcccaccca ccccacagca caggtgctcc ccgcttcccc atcaattgcc ccatccccat 4920 
cccaggcctc aggtcacaca ggaagtgatg gcagagtcac ttcctatcca ggcacctatg 4980 
acctctcacc tccacacccc acccatcgga ggctgatacc cccgtgagaa ggcatcagac 5040 
tcacccctgt ccagggaggt tgcctggaga gtgagccact ctcaaagtca ctcagacctg 5100 
ggctcacctg gtggttctgc cagtcctagc tgttgacagt gaaacgttcc caaaatatct 5160 
ggttgaaatc tgcaaacatt ggagcactga gacctacctc caaacaagcc tgtaatattt 5220 
aaccatgtct gttctatgaa ggatgtcaca gtctgtcctg atctcccttg cagctccatc 5280 
acctagcaca gggtacagcc aatattggct caattgaaat ttgtggaatc cacagagaaa 5340 
agcacccggc acacaccgta gcccatgctg ggggctcagg aagtgctgga ttcaaaactg 5400 
tgggctgtta gagttccttg gagccctaaa gttcctcctt accatacgat gcagacccag 5460 
gaagggccac ctgcgctatg gtcagaggag ctggtggcag agcccgtgca gagatggtcc 5520 
ctgtgccccc ggcccagtgc tctttctcct aaaccacact gccagcccca aggcagccaa 5580 
cctcaggtct ggtgaactgc tggtgttaaa ttatcataga gtgggtgtca aaagatgggc 5640 
tactaagtac aaaaatgccc aaggtgctac atgggacctg aagattttca aaaggaggca 5700 
agaaagagat aggcagatgt ttcaaggatg tggggtgggg gaggtcttgg taaggaaaat 5760 
ggcccaggct gtgtgtcagc aataggagag gagggggcac aggtgatcag aaaagacact 5820 
gggggaagca ttgatggaca ggaatagaaa tggcaaagtg gataattaag aggaaggagg 5880 
atgaggagat gaacacaggg tattagaaaa taatagaagg cagggcttgg tggctcactc 5940 
ttgtaatccc agcactttgg gaggctgagg caggcagatc acctaaggtc aggagctcga 6000 
gaccagcccg gccaacatgg tgaaaccctg tctctactaa taatacaaaa atagcctggc 6060 
atggtggcac acgtctgtgg tcccagctac tcaggaggct gaggcaggag aattgcttga 6120 
acccaggagg cagaggttac agtggccaaa atcctaccat tgcactacag cctgggtgac 6180 
aagagtgaaa cgttgtctaa aaacaaaaaa caaaaaacaa aaaaaggaaa taatagtagc 6240 
tgacattcac tgagcactta ctttgtgcca ggcccatcta tgagcatata taatgctcag 63 00 
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aatagccccc taaaacagtg ctcttggcat tgccatttca gaggtgagga aatagaggca 63 60 
cagggagttg agtggctcca gttcaggcaa cacaccaggt gggggtgggg ggctggggag 6420 
agacctggga cgtgagccca gacagcttga gagctttcag agtctatgcc aacagcacca 64 80 
accagtgctg ggtaaacacc tgcttttatc atcagaacaa agaggctgtg tcccctgccc 654 0 
tatgaggtcc atttctgaga gttgtggcta atgggcaaga aggttggggc tttSgagatt 6600 
tgggataaag atatcaaaca ccagaaaggt agaaagaagt gatcagatta gggttactta 6660 
ggtgatgata tgaactcttc ctagaaccga gagaaaaaga gagccttccc ttactcatat 6720 
gaaatcacaa ataatttcta tccaatttgg aagtacactt tggcgtagtt gtgacagctt 6780 
cctcaggact cagcataaat tcaaacaaat aattgtcctt agaagagatg ctatagaaga 6840 
gatagaaata tattcatatt ctgtagcttt tttttttttg agatggagtt ttgctcttgt 6900 
cacccaagct ggagtgcagt gatgcaatct cagctcactg caaactttgc ctcctgggtt 6960 
caagggattc tcctgcctca gcctcccgat aactgggact acaggctaca ggcatgtgtc 7020 
actactcctg gttaattttt tttttttttt tttaagactg agtcttgctc tgtctttcag 7080 
gctgatgtac aatggctcca tctcggctca ctacaacttc tgtcccccag gttcaagcga 7140 
ttctcctgcc tcagcctcat gagtagctgg gattacaggc atgtgccagc acacccagca 7200 
aatttttgta tttttagtag agatgaggtc ttaccatgtt ggccaggctg gtctcaaact 7260 
cctgacctca ggtgatcctt tggcctcagc ctccctaact gctgggatta caggcatgag 7320 
ccactgcgtc cagcctaatt ttatattttt ggtagagatg gggtttcacc atattggcca 7390 
ggctggtctc gaactcatga cctaaggtga tccatcctcc tcagcctctc aaagtgctgg 7440 
gattacaagt gtgagccact gggcccggtg cttttttttt tttttttttt tttttttttt 7500 
tgagataggg tctcactctg tcacccaggc tgaaatgcag tagtgtgatt ttggctcatt 7560 
gcagccttga cttcccaggc tgaagtgatc ctcccacctc agcctcctga gtagctgggg 7620 
ctacaggcat gcaccaccat gctgcgctaa tttttatatt ttttgtagtg gtgggatttc 7680 
gccatatcac cctggctggt ctggaacccc tgggctcaag cgatccactc gcttcagctt 7740 
ctcaaagtgc tgggattaca ggcatgagcc acagcgccca ggctgtagct ctcttaagga 7800 
ggaacatatc tcatctgaga caaacctgaa atgccaaacc aaactgagtt agcccctctc 7860 
tgtctgttgt atatattgga gtaataacct atttgtcttg ataaagggat tgcatgcttg 7920 
aattgcaaaa acctttattt cttttgggtt gcccaatgtg caagactaag agttattttg 7980 
ataaatttct caccaggctg actgtctctc tgtggggtcg ggggagtttt cagggtctca 8040 
cgtattgcag ggaaggtttg gttgtgagat cgagaataac agaagcagcg gagcattctg 8100 
gaaatattac tatgatggaa aggactacat tgaattcaac aaagaaatcc cagcctgggt 8160 
ccccttcgac ccagcagccc agataaccaa gcagaagtgg gaggcagaac cagtctacgt 8220 
gcagcgggcc aaggcttacc tggaggagga gtgccctgcg actctgcgga aatacctgaa 8280 
atacagcaaa aatatcctgg accggcaagg tactcactgc ttcctgctcc ccagtactga 8340 
gcccagaata aaagacgatc tcaggctagg agctcaggca acatcttagt ccggtctcat 8400 
ctgttcctgg atgtccctca gacccccagc tttcatcttt taggatttat tccttccctg 84 60 
ggataatata atttgtggtc caaaaagaac atcatcaaaa tttcaggcag aatgggccag 8520 
gaaggccatt ctttcttgat gagtgtcccc aaatcatctc caattaacag acaaggagct 8580 
tgaggttagg gaggtgaggg taacactgtc tgtaagaggc agagctggga ctcaaattcc 8640 
agatttcaga ttccaaatcc catcgtt'ttt tatctctaca atgatgcctc ccatctgggt 8700 
ggtggagaga agggaggcgt gtaaaagtca gccccagaag gacaagagca agccagtgtg 8760 
agcggaattg atggctgcaa gctgagactt ggattggaga cgtagtgaga ctcaggattg 8820 
tgcagtgctg cagggaagtg gttgctggat agaggcatgg gctgaaccaa gcagctggac 8880 
tgagactggg ggacagaact ccaaagccca ctgagatgtg ggaaaacatg gagaagcaca 8 94 0 
cggagcattc acaacttatt gccgtcagag tcaatacatg ggtgaggtgg ggattgggca 9000 
agagggaaag cgtcagcctt ccctgatatt ctggaaagtc tcccggggct gggggtgggc 9060 
aggtacagag cttcgagctc tgctgatcgc tgacatccag gggtgggggt aggaagagac 912 0 
ctgggccggg agaagtccac ctcaagcctg cagtgtcaca ctctatccct ccacagatcc 9180 
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tccctctgtg gtggtcacca gccaccaggc cccaggagaa aagaagaaac tgaagcgcct 9240 
ggcctacgac ttctacccag ggaaaactga cgtgcactgg actcgggccg gcgaggtgca 93 00 
ggagcctgag ttacggggag atgttcttca caatggaaat ggcacttacc agtcctgggt 9360 
ggtggtggca gtgcccccgc aggacacagc cccctactcc cgccacgtgc agcacagcag 9420 
cctggcccag cccctcgtgg tgccctggga ggccagctag gaagcaaggg ttggaggcaa 94 80 
tgtgggatct cagacccagt agctgccctt cctgcctgat gtgggagctg aaccacagaa 9540 
atcacagtca atggatccac aaggcctgag gagcagtgtg gggggacaga caggaggcgg 9600 
atttggagac cgaagactgg gatgcctgtc ttgagtagac ttggacccaa aaaatcatct 9660 
caccttgagc ccacccccac cccattgtct aatctgtaga agctaataaa taatcatccc 9720 
tccttgccta gcataacaga gaatcctttt tttaacggtg atgcgctgta gaaatgtgac 9780 
tagattttct cattggttct gccctcaagc actgaattc 9819 

<210> 3 

<211> 250 

<212> DNA 

<213> Homo sapiens 

<400> 3 

cgcccctgcg ccgccgagcc agctgccaga atgccgaact ggggaggagg caagaaatgt 60 
ggggtgtgtc agaagacggt ttactttgcc gaagaggctc agtgcgaagg caacagcttc 120 
cataaatcct gcttcctgtg catggtctgc aagaagaatc tggacagtac cactgtggcc 180 
gcgcatggtg aggagattta ctgcaagtcc tgctacggca agaagtatgg gcccaaaggc 240 
tatggctacg 250 

<210> 4 

<211> 1900 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> unsure 
<222> (16) 

<220> 

<221> unsure 
<222> (18) 

<220> 

<221> unsure 
<222>' (20) 

<220> 

<221> unsure 
<222> (1887) 

<220> 

<221> unsure 
<222> (1894) 
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<400> 4 

acgccttccg cggagnanan caaaacggcg cgcaggccgg gcgcacccag ccgccacttc 60 
cgagagcgcc tgccgcccct ggcgccgccg agccagctgc cagaatgccg aactggggag 120 
gaggcaagaa atgtggggtg tgtcaagaag acggtttact ttgccgaaga ggtTcagtgc 180 
gaaggcaaca gcttccataa atcctgcttc ctgtgcatgg tctgcaagaa gaatctggac 240 
agtaccactg tgggccgtgc atggtgagga gatttactgg caagtccctg ctacggcaag 3 00 
aagtatgggc ccaaaggcta tggctacggg ccagggcgca ggcaccctca gcaccgacaa 3 60 
gggggagtcg ctgggtatca agcacgagga agcccctggg ccacaggccc accaccaacc 420 
ccaatggcat ccaaatttgc ccagaagact ggtggctccg agcgctgccc ccgatgcagc 4 80 
caggcagtct atgctgcgga gaaggtgatt ggtgctggga agtcctggca taaggcctgc 54 0 
tttcgatgtg ccaagtgtgg caaaggcctt gagtcaacca ccccgggcag acaaggacgg 600 
cgagatttac tgcaaaggat gttatgctaa aaacttcggg cccaagggct ttggttttgg 660 
gcaaggagct ggggccttgg tccactctga gtgaggccac caccacccac cacaccctgc 720 
ccactcctgc gcttttcatc gccattccat tcccagcagc tttggagacc tccaggatta 780 
tttctctgtc agccctgcca catatcacta atgacttgaa cttgggcatc tggctccctt 840 
tggtttgggg gtctgcctga ggtcccaccc cactaaaggg ctccccaggc ' ctgggatccg 900 
acaccatcac cagtaggaga cctcagtgtt ttgggtctag gtgagagcag gcccctctcc 960 
ccacacctcg ccccacagag ctctgttctt agcctcctgt gctgcgtgtc catcatcagc 1020 
tgaccaagac acctgaggac acatcttggc acccagagga gcagcagcaa caggctggag 1080 
ggagagggaa gcaagaccaa gatgaggagg ggggaaggct gggttttttg gatctcagag 114 0 
attctcctct gtgggaaaga ggttgagctt cctggtgtcc ctcagagcaa gcctgaggag 1200 
tcccagctta gggagttcac tattggaggc agagaggcat gcaggcaggg tcctaggagc 1260 
ccctgcttct ccaggcctct tgcccttgag tctttgtgga atggatagcc tcccactagg 1320 
actgggagga gaataaccca ggtcttaagg accccaaagt caggatgttg tttgatcttc 1380 
tcaaacatct agttccctgc ttgatgggag gatcctaatg aaatacctga aacatatatt 1440 
ggcatctatc aatggctcaa atcttcattt atctctggcc ttaaccctgg ctcctgaggc 1500. 
tgcggccagc agagcccagg ccagggctct gttcttgcca cacctgcttg atcctcagat 1560 
gcggagggag gtaggcactg cctcagtcct catccaaaca cctttccctt tgccctgaga 1620 
cctcagaatc ttccctttaa cccaagaccc tgcctcttcc actccaccct tctccaggga 1680 
cccttagatc acatcactcc acccctgcca ggccccaggc caggaatagt ggcgggagga 1740 
aggggaaagg gctgggcctc accgctccca gcaactgaaa ggacaacact atctggagcc 1800 
acccactgaa agggctgcag gcacgggctg tacccaagct gacttctcat ctggtcaaca 1860 
aagctgttta gaccagaaaa aaaaaanaaa aaanaaaagg 1900 

<210> 5 

<211> 273 

<212> DNA 

<213> Homo sapiens 

<400> 5 

gatgcatcaa aagagctgca agttctccac attgacttct tgaatcagga caacgccgtt 60 
tctcaccaca catgggagtt ccaaacgagc agtcctgtgt cccggcgagg acaggtgttt 120 
cacctgcggc tggtgctgaa ccagccccta caatcctacc accaactgaa actggaatcc 180 
agcacagggc cgaatcctag caccgccaaa cacaccctgg tggtgctcga cccgaggacg 240 
ccctcagacc actacaactg gcaggcaacc ctt 273 

<210> 6 
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<211> 3021 

<212> DNA 

<213> Homo sapiens 

<400> 6 * 
tgtggaagca ccaggcatca gagatagagt cttccctggc attgcaggag agaatccgaa 60 
gggatgatgg atgcatcaaa agagctgcaa gttctccaca ttgacttctt gaatcaggac 120 
aacgccgttt ctcaccacac atgggagttc caaacgagca gtcctgtgtt ccggcgagga 180 
caggtgtttc acctgcggct ggtgctgaac cagcccctac aatcctacca ccaactgaaa 240 
ctggaattca gcacagggcc gaatcctagc accgccaaac acaccctggt ggtgctcgac 300 
ccgaggacgc cctcagacca ctacaactgg caggcaaccc ttcaaaatga gtctggcaaa 360 
gaggtcacag tggctgtcac cagttccccc aatgccatcc tgggcaagta ccaactaaac 420 
gtgaaaactg gaaaccacat ccttaagtct gaagaaaaca tcctatacct tctcttcaac 4 80 
ccatggtgta aagaggacat ggttttcatg cctgatgagg acgagcgcaa agagtacatc 54 0 
ctcaatgaca cgggctgcca ttacgtgggg gctgccagaa gtatcaaatg caaaccctgg 600 
aactttggtc agtttgagaa aaatgtcctg gactgctgca tttccctgct gactgagagc 660 
tccctcaagc ccacagatag gagggacccc gtgctggtgt gcagggccat gtgtgctatg 720 
atgagctttg agaaaggcca gggcgtgctc attgggaatt ggactgggga ctatgaaggt 780 
ggcacagccc catacaagtg gacaggcagt gccccgatcc tgcagcagta ctacaacacg 840 
aagcaggctg tgtgctttgg ccagtgctgg gtgtttgctg ggatcctgac tacagtgctg 900 
agagcgttgg gcatcccagc acgcagtgtg acaggcttcg attcagctca cgacacagaa 960 
aggaacctca cggtggacac ctatgtgaat gagaatggca agaaaatcac cagtatgacc 1020 
cacgaccctg tctggaattt ccatgtgtgg acggatgcct ggatgaagcg accggatctg 1080 
cccaagggct acgacggctg gcaggctgtg gacgcaacgc cgcaggagcg aagccagggt 114 0 
gtcttctgct gtgggccatc accactgacc gccatccgca aaggtgacat ctttattgtc 1200 
tatgacacca gattcgtctt ctcagaagtg aatggtgaca ggctcatctg gttggtgaag 1260 
atggtgaatg ggcaggagga gttacacgta atttcaatgg agaccacaag catcgggaaa 1320 
aacatcagca ccaaggcagt gggccaagac aggcggagag atatcaccta tgagtacaag 13 80 
tatccagaag gctcctctga ggagaggcag gttcatggat catgccttcc tccttctcag 1440 
ttctgagagg gagcacagac gacctgtaaa agagaacttt cttcacatgt cggtacaatc 1500 
agacgatgtg ctgctgggaa actctgttaa tttcaccgtg attcttaaaa ggaagaccgc 1560 
tgccctacag aatgtcaaca tcttgggctc ctttgaacta cagttgtaca ctggcaagaa 1620 
gatggcaaaa ctgtgtgacc tcaataagac ctcgcagatc caaggtcaag catcagaagt 1680 
gactctgacc ttggactcca agacctacat caacagcctg gctatattag atgatgagcc 1740 
agttatcaga ggtttcatca ttgcggaaat tgtggagtct aaggaaatca tggcctctga 1800 
agtattcacg tctttccagt accctgagtt ctctatagag ttgcctaaca caggcagaat 1860 
tggccagcta cttgtctgca attgtatctt caagaatacc ctggccatcc ccttgactga 1920 
cgtcaagttc tccttggaaa gcctgggcat ctcctcacta cagacctctg accatgggtg 1980 
agtctgcctg aggacggtgc agcctggtga gaccatccaa tcccaaataa aatgcacccc- 2040 
aataaaaatg gacccaagaa atttatcgtc aagttaagtt ccaaacaagt gaaagagatt 2100 
aatgctcaga agattgttct catcaccaag tagccttgtc tgatgctgtg gagccttagt 2160 
tgagatttca gcatttccta ccttgtggct tagctttcag attatggatg attaaatttg 2220 
atgacttata tgagggcaga ttcaagagcc agcaggtcaa aaaggccaac acaaccataa 2280 
gcagccagac ccacaaggcc aggtcctgtg ctatcacagg gtcaccttct tttacagtta 2340 
gaaacaccag ccgaggccac agaatcccat ccctttcctg agtcatggcc tcaaaaatca 2400 
gggccaccat tgtctcaatt caaatccata gatttcgaag ccacagattc tctccctgga 2460 
gcaagcacga ctatgggcag cccagtgctg ccacctgctg acgacccttg agaagccgcc 2520 
atatcttcag gccatgggtt caccagccct gaaggcacct gtcaactgga gtgctctctc 2580 
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agcactggga tgggcctgat agaagtgcat 
ctatccctga aatccaggaa gtccctctcc 
gcaaggacat ttctcaaggg ccatgtggtt 
tcaccataga gacccatgtc agcaaacggt 
gctgcccctt gggagacccc agggagaagg 
tttggtattc catccactat cctggcaact 
ccttcttgtt ctgccctcca gagatttgct 
tactccaaga aaaaaaaacc g 



tctcctccta ttgcctccat tctcctctct 2640 
tggtgctcca agcagtttga agcccaatcc 2700 
ttgcagacaa ccctgtcctc aggcctgaac 2760 
gaccagcaaa tcctcttccc ttattctaaa 2820 
cattgcttcc tccctggtgt gaactctttc 2880 
caaggctgct tctgttaact gaagcctgct 294 0 
caaatgatca ataagcttta aattaaactc 3000 

3021 



<210> 7 

<211> 267 

<212> DMA 

<213> Homo sapiens 



<400> 7 

gaacattcca gatacctatc actactcgat 
tcagggtcac caccagctat tggaccttac 
ccctatcccg cacagcccac tgtggtcccc 
tacccgcccc ccgtgcccca gtacgccccg 
gtctgcacgc agcccaaatc cccatcc 



gctgttgata acagcaagat ggctttgaac 60 
tatgaaaacc atggatacca accggaaaac 120 
actgtctacg aggtgcatcc ggctcagtac 180 
agggtcctga cgcaggcttc caaccccgtc 240 

267 



<210> 8 

<211> 3443 

<212> DMA 

<213> Homo sapiens 



<400> 8 

gggcgggccg ggccgagtag gcgcgagcta 
aggggcgggg agcgccgccc ggagcgcggc 
attactcgat gctgttgata acagcaagat 
tggaccttac tatgaaaacc atggatacca 
tgtggtcccc actgtctacg aggtgcatcc 
gtacgccccg agggtcctga cgcaggcttc 
cccatccggg acagtgtgca cctcaaagac 
ggggaccttc ctcgtgggag ctgcgctggc 
caagtgctcc aactctggga tagagtgcga 
ctggtgtgat ggcgtgtcac actgccccgg 
ctacggacca aacttcatcc ttcaggtgta 
gtgccaagac gactggaacg agaactacgg 
gaataatttt tactctagcc aaggaatagt 
actgaacaca agtgccggca atgtcgatat 
ttcttcaaaa gcagtggttt ctttacgctg 
ccgccagagc aggatcgtgg gcggcgagag 
tcagcctgca cgtccagaac gtccacgtgt 
tcgtgacagc cgcccactgc gtggaaaaac 
ttgcggggat tttgagacaa tctttcatgt 
tgatttctca tccaaattat gactccaaga 
tgcagaagcc tctgactttc aacgacctag 



agcaggaggc ggaggcggag gcggagggcg 60 
aggtcatatt gaacattcca gatacctatc 120 
ggctttgaac tcagggtcac caccagctat 180 
accggaaaac ccctatcccg cacagcccac 240 
ggctcagtac tacccgtccc ccgtgcccca 300 
caaccccgtc gcctgcacgc agcccaaatc 360 
taagaaagca ctgtgcatca ccttgaccct 420 
cgctggccta ctctggaagt tcatgggcag 480 
ctcctcaggt acctgcatca acccctctaa 540 
cggggaggac gagaatcggt gtgttcgcct 600 
ctcatctcag aggaagtcct ggcaccctgt 660 
gcgggcggcc tgcagggaca tgggctataa 720 
ggatgacagc ggatccacca gctttatgaa 780 
ctataaaaaa ctgtaccaca gcgatgcctg 840 
tatagcctgc ggggtcaact tgaactcaag 900 
cgcgctcccg ggggcctggc cctgggcagg 960 
gcggaggctc catcatcacc cccgagtgga 1020 
ctcttaacaa tccatggcat tggacggcat 1080 
tctatggagc cggataccaa gtagaaaaag 1140 
ccaagaacaa tgacattgcg ctgatgaagc 1200 
tgaaaccagt gtgtctgccc aacccaggca 1260 
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tgatgctgca gccagaacag ctctgctgga tttccgggtg gggggccacc gaggagaaag 132 0 
ggaagacctc agaagtgctg aacgctgcca aggtgcttct cattgagaca cagagatgca 1380 
acagcagata tgtctatgac aacctgatca caccagccat gatctgtgcc ggctccctgc 1440 
aggggaacgc cgattcttgc cagggtgaca gtggagggcc tctggtcact tcgaagaaca 1500 
atatctggtg gctgataggg gatacaagct ggggttctgg ctgtgccaaa gcttacagac 1560 
caggagtgta cgggaatgtg atggtattca cggactggat ttatcgacaa atgagggcag 162 0 
acggctaatc cacatggtct tcgtccttga cgtcgtttta caagaaaaca atggggccgg 1680 
ttttgcttcc ccgtgcatga tttactctta gagatgattc agaggtcact tcatttttac 1740 
taaacagcga acttgtctgg ctttggcact ctctgccatt ctgtgcaggc tgcagtggct 1800 
cccctgccca gcctgctctc cctaacccct tgtccgcaag gggtgatggc cggctggttg 1860 
tgggcactgg cggtcaagtg tggaggagag gggtggaggc tgccccattg agatcttcct 1920 
gctgagtcct ttccaggggc caattttgga tgagcatgga gctgtcacct ctcagctgct 198 0 
ggatgacttg agatgaaaaa ggagagacat ggaaagggag acagccaggt ggcacctgca 2040 
gcggctgcct ctggggccac ttggtagcgt ccccagccta cctctccaca aggggatttt 2100 
gctgatgggc tcttagagcc ttagcagccc tggatggtgg ccagaaataa agggaccagc 2160 
ccttcatggg tggtgacgtg gtagtcacct tgtaagggga acagaaacat ttttgttctt 2220 
atggggtgag aatatagaca gtgcccttgg gtgcgaggga agcaattgaa aaggaacttg 2280 
ccctgagcac tcctggtgca ggtctccacc tgcacattgg gtggggctcc tgggagggag 2340 
actcagcctt cctcctcatc ctccctgacc ctgctcccag caccctggag agtgcacatg 2400 
ccccttggtc ctgggcaggg gcgccaagtc tggcaccatg ttggcctctt caggcctgcc 2460 
agtcactgga aattgaggtc catgggggaa atcaaggatg ctcagcttaa ggtacactgt 2520 
ttccatgtta tgtttctaca cattgctacc tcagtgctcc tggaaactta gcttttgatg 2580 
tctccaagta gtccaccttc atttaactct ttgaaactgt atcatctttg ccaagtaaga 2640 
gtggtggcct atttcagctg ctttgacaaa atgactggct cctgacttaa cgttctataa 2700 
atgaatgtgc tgaagcaaag tgcccatggt ggcggcgaag aagagaaaga tgtgtttcgt 2760 
tttggaccct ctgtggtccc ttccaatgct gtgggtttcc aaccagggga agggtccctt 2820 
ttgcattgcc aagtgccata accatgagca ctactctacc atggttctgc ctcctggcca 2880 
agcaggctgg tttgcaagaa tgaaatgaat gattctacag ctaggactta accttgaaat 2940 
ggaaagtctt gcaatcccat ttgcaggatc cgtctgtgca catgcctctg tagagagcag 3000 
cattcccagg gaccttggaa acagttggca ctgtaaggtg cttgctcccc aagacacatc 3 060 
ctaaaaggtg ttgtaatggt gaaaacgtct tccttcttta ttgccccttc ttatttatgt 3120 
gaacaactgt ttgtcttttt ttgtatcttt cttaaactgt aaagttcaat tgtgaaaacg 3180 
aatatcatgc aaataaatta tgcgactttt ttttcaaagt aaccactgca tctttgaagt 3240 
tctgcctggt gagtaggacc agcctccatt tccttataag ggggtgatgt tgaggctgct 3 300 
ggtcagagga ccaaaggtga ggcaaggcca gacttggtgc tcctgtggtt ggtgccctca 3360 
gttcctgcag cctgtcctgt tggagaggtc cctcaaatga ctccttctta ttattctatt 3420 
agtctgtttc catgggcgtg ata 3443 

<210* 9 
<211> 254 
<212> DNA 

<213> Homo sapiens 



<400> 9 

gtgctgcacc aggccaccat cctgcccaag 
ctggaggcct cccgtgcctt cgaggtgtca 
gtgtaccagt gggatgaccc tgaccccagg 
aaccccacgg agcccctctt cctggcccag 



actgggacag tgtccctgga ggtacggctc 60 
gagaacggca acctggtagt gagtgggaag 120 
ctcttcgacc acccggaaag . ccccaccccc 180 
gctgaagttt acaaggagct gcgtctgcgt 240 
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ggctacgact acgg 254 

<210> 10 
<211> 8470 

<212> DNA * 
<213> Homo sapiens 

<220> 

<221> unsure 
<222> (4131) 

<220> 

<221> unsure 
<222> (5117) 

<220> 

<221> unsure 
<222> (5552) 

<400> 10 

cggccgtcga cacggcagcg gccccggcct ccctctccgc cgcgcttcag cctcccgctc 60 
cgccgcgctc cagcctcgct ctccgccgcc cgcaccgccg cccgcgccct caccagagca 12 0 
gccatggagg aggtggtgat tgccggcatg tccgggaagc tgccagagtc ggagaacttg 180 
caggagttct gggacaacct catcggcggt gtggacatgg tcacggacga tgaccgtcgc 240 
tggaaggcgg ggctctacgg cctgccccgg cggtccggca agctgaagga cctgtctagg 3 00 
tttgatgcct ccttcttcgg agtccacccc aagcaggcac acacgatgga ccctcagctg 3 60 
cggctgctgc tggaagtcac ctatgaagcc atcgtggacg gaggcatcaa cccagattca 420 
ctccgaggaa cacacactgg cgtctgggtg ggcgtgagcg gctctgagac ctcggaggcc. 4 80 
ctgagccgag accccgagac actcgtgggc tacagcatgg tgggctgcca gcgagcgatg 54 0 
acggccaacc ggctctcctt cttcttcgac ttcagagggc ccagcaccgc actggacaca 600 
gcctgctcct ccagcctgat ggccctgcag aacgcctacc aggccatcca cagcgggcag 660 
tgccctgccg ccatcgtggg gggcatcaat gtcctgctga agcccaacac ctccgtgcag 720 
ttcttgaggc tggggatgct cagccccgag ggcacctgca aggccttcga cacagcgggg 780 
aatgggtact gccgctcgga gggtgtggtg gccgtcctgc tgaccaagaa gtccctggcc 840 
cggcgggtgt acgccaccat cctgaacgcc ggcaccaata cagatggctt caaggagcaa 900 
ggcgtgacct tcccctcagg ggatatccag gagcagctca tccgctcgtt gtaccagtcg 960 
gccggagtgg cccctgagtc atttgaatac atcgaagccc acggcacagg caccaaggtg 1020 
ggcgaccccc aggagctgaa tggcatcacc cgagccctgt gcgccacccg ccaggagccg 1080 
ctgctcatcg gctccaccaa gtccaacatg gggcacccgg agccagcctc ggggctggca 1140 
gccctggcca aggtgctgct gtccctggag cacgggctct gggcccccaa cctgcacttc 1200 
catagcccca accctgagat cccagcgctg ttggatgggc ggctgcaggt ggtggaccag 1260 
cccctgcccg tccgtggcgg caacgtgggc atcaactcct ttggctccgg gggctccaaa 1320 
cgtgcacatc accctgaggc ccaacacgca gccgcccccc gcacccggcc cacatgccac 13 BO 
cctgccccgt ctgctgcggg ccagcggacg cacccctgag gccgtgcaga agctgctgga 1440 
gcagggcctc cggcacagcc agggcctggc tttcctgagc atgtgaacga catcgcggct 1500 
gtccccgacc accgccatgc ccttccgtgg ctacgctgtg ctgggtggtg agacgcggtg 1560 
gcccagaggt gcagcaggtg cccgctggcg agcgcccgct ctggttcatc tgccctggga 1620 
tgggcacaca gtggcgcggg atggggctga gcctcatgcg cctggaccgc ttccgagatt 1680 
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ccatcctacg ctccgatgag gctgtgaacc gattcggcct gaaggtgtca cagctgctgc 174 0 
tgagcacaga cgagagcacc . tttgatgaca tcgtccattc gtttgtgagc ctgactgcca 1800 
tccagatagg cctcatagac ctgctgagct gcatggggct gaggccagat ggcatcgtcg I860 
gccactccct gggggaggtg gcctgtggct acgccgacgg ctgcctgtcc caggaggagg 192 0 
ccgtcctcgc tgcctactgg aggggacagt gcatcaaaga agcccatctc ccgccgggcg 198 0 
ccatggcagc cgtgggcttg tcctgggagg agtgtaaaca gcgctgcccc ccggcggtgg 204 0 
tgcccgccgc cacaactcca aggacacagt caccatctcg ggacctcagg ccccggtgtt 2100 
tgagttcgtg gagcagctga ggaaggaggg tgtgtttgcc aaggaggtgc ggaccggcgg 2160 
tatggccttc cactcctact tcatggaggc catcgcaccc ccactgctgc aggagctcaa 2220 
gaaggtgatc cgggagccga agccacgttc agcccgctgg ctcagcacct ctatccccga 2280 
ggcccagtgg cacagcagcc tggcacgcac gtcctccgcc gagtacaatg ccaacaacct 2340 
ggtgagccct gtgctgttcc aggaggccct gtggcacgtg cctgagcacg cggtggtgct 2400 
ggagaccgcg ccccacgccc tgctgcaggc tgtcctgaag cgtggcctga agccgagctg 2460 
caccatcatc cccctgatga agaaggatca cagggacaac ctggagttct tcctggccgg 2520 
catcggcagg ctgcacctct caggcatcga cgccaacccc aatgccttgt tcccacctgt 2580 
ggagtcccca gctccccgag gaactcccct catctcccca ctcatcaagt gggaccacag 2640 
cctggcctgg gacgcgccgg ccgccgagga cttccccaac ggtccaggtt ccccctcagc 2700 
caccatctac acatgcacac caagctccga gtctcctgac cgctacctgg tggaccacac 2760 
catcgacggt cgcgtcctct tccccgccac tggctacctg agcatagtgt ggaagacgct 2820 
ggcccgaccc ctgggcctgg gcgtcgagca gctgcctgtg gtgtttgagg atgtggcgct 2880 
gcaccaggcc accatcctgc ccaagactgg gacagtgtcc ctggaggtac ggctcctgga 2940 
ggcctcccgt gccttcgagg tgtcagagaa cggcaacctg gtagtgagtg ggaaggcgta 3000 
ccagtgggat gacccngacc ccaggctctt cgaccacccg gaaagcccca cccccaaccc 3 060 
cacggagccc ctcttcctgg cccaggctga agtttacaag gagctgcgtc tgcgtggcta 3120 
cgactacggc cctcatttcc agggcatcct ggaggccagc ctggaaggtg actcggggag 3180 
gctgctgtgg aaggataatg ggtgagttca Cggacaccat gctgcagatg tccatcctgg 3240 
gtcggccaag cacggcctgt acctgcccac ccgtgtcacc gccatccaca tcgaccctgc 3300 
cacccacagg cagaagctgt acacactgca ggacaaggcc caagtggctg acgtggtggt 3360 
gagcaggtgg ctgagggtca cagtggccgg aggcgtccac atctccgggc tccacactga 3420 
gtcggccccg cggcggcagc aggagcagca ggtgcccatc ctggagaagt tttgcttcac 3480 
tccccacacg gaggaggggt gcctgtctga gcacgctgcc ctcgaggagg agctgcaact 3540 
gtgcaagggg ctggtcgagg cactcgagac caaggtgacc cagcaggggc tgaagatggt 3 600 
ggtgcccgga ctggatgggg cccagatccc cccgggaccc ctcacagcag gaactgcccc 3660 
ggctgttgtc ggctgcctgc aggcttcagc tcaacgggaa cctgcagctg gagctggcgc 3720 
aggtgctggc ccaggagagg cccaagctgc cagaggaccc tctgctcagc ggcctcctgg 3 780 
actccccggc actcaaggcc tgcctggaca ctgccgtgga gaacacgccc agcctgaaga 3 840 
tgaaggtggt ggaggtgctg gccggccacg gtcacctgta ttcccgcatc ccaggcctgc 3 900 
tcagccccca tcccctgctg cagctgagct acacggccac cgaccgccac ccccaggccc 3 960 
tggaggctgc ccaggccgag ctgcagcagc acgacgttgc ccagggccag cgggatcccg 4 020 
cagaccctgc ccccagcgcc ctgggcagcg cggacctcct ggtgtgcaac tgtgctgtgg 4 08 0 
* ctgccctcgg ggacccgcct cagctctcag caacatggtg gctgccctga nagaaggggg 414 0 
ctttctgctc ctgcacacac tgctccgggg gcaccccctc ggggacatcg tggccttcct 4200 
cacctccact gagccgcagt atggccaggg catcctgagc caggacgcgc gggagagcct 4260 
cttctccagg gtgtcgctgc gcctggtggg cctgaagaag tccttctacg gctccacgct 4 320 
cttcctgtgc cgccggccca ccccgcagga cagccccatc ttcctgccgg tggacgacac 4380 
cagcttccgc tgggtggagt ctctgaaggg catcctggct gacgaagact ctttcccggc 444 0 
ctgtgcggct gaaggccatc aactgttcca cctcgggcgt ggtgggcttg gtgaactgtc 4 500 
tccgccgaga gcccggcgga acgctccggt gtgtgctgct ctccaacctc agcagcacct 4560 
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cccacgtccc ggaggtggac ccgggctccg cagaactgca gaaggtgttg cagggagacc 4 62 0 
tggtgatgaa cgtctaccgc gacggggcct ggggggcttt ccgccacttc ctgctggagg 4680 
aggacaagcc tgaggagccg acggcacatg cctttgtgag caccctcacc cggggggacc 4 74 0 
tgtccctcca tccgctgggt ctgctcctcg ctgcgccatg cccagcccac ctgccctggc 4 800 
gcccagctct gcacggtcta ctacgcctcc ctcaacttcc gcgacatcat gctggccact 4 860 
ggcaagctgt cccctgatgc catcccaggg aagtggacct cccaggacag cctgctaggt 4 92 0 
atggagttct cgggccgaga cgccagcggc aagcgtgtga tgggactggt gcctgccaag 4 98 0 
ggcctggcca cctctgtcct gctgtcaccg gacttcctct gggatgtgcc ttccaactgg 504 0 
acgctggagg aggcggcctc ggtgcctgtc gtctacagca cggcctacta cgcgctggtg 5100 
gtgcgtgggc gggtgcnccc cggggagacg ctgctcatcc actcgggctc gggcggcgtg 5160 
ggccaggccg ccaccgccat cgccctcagt ctgggctgcc gcgtcttcac caccgtgggg 5220 
tcggctgaga agcgggcgta cctccaggcc aggctccccc agctcgacag caccagcttc 5280 
gccaactccc gggacacatc cctcgagcag catgtgctgt ggcacacggg cgggaagggc 5340 
gttgacctgg tcttgaactc cttggcggaa gagaagctgc aggccagcgt gaggtgcttg 54 00 
gctacgcacg gtcgcttcct ggaaattggc aaattcgacc tttctcagaa ccacccgctc 5460 
ggcatggcta tcttcctgaa gaacgtgaca ttccacgggg tcctactgga tgcgttcttc 5520 
aacgagagca gtgctgactg gcgggaggtg tnggcgcttg tgcaggccgg catccgggat 5580 
ggggtggtac ggcccctcaa gtgcacggtg trccatgggg cccaggtgga ggacgccttc 5640 
cgctacatgg cccaagggaa gcacattggc aaagtcgtcg tgcaggtgcL tgcggaggag 5700 
ccggaggcag tggctgaagg gggccaaacc caagctgatg tcggccatct ccaagaccct 5760 
ctgcccggcc cacaagagct acatcatcgc tggtggtctg ggtggcttcg gcctggagtt 5820 
ggcgcagtgg ctgatacagc gtggggtgca gaagctcgtg ttgacttctc gccccgggat 5880 
ccggacaggc taccaggcca agcaggtccg ccggtggagg cgccagggcg tacaggtgca 5940 
ggtgtccacc agcaacatca gctcactgga gggggcccgg ggcctcattg ccgaggcggc 6000 
gcagcttgag gcccgtgggc ggcgtcttca acctggccgt ggtcttgaga gatggcttgc 6060 
tggagaacca gaccccagag ttcttccagg acgtctgcaa gcccaagtac agcggcaccc 6120 
tgaacctgga cagggtgacc cgagggcgtg ccctgagctg gactactttg tggtcttctc 6180 
ctctgtgagc tgcgggcgtg gcaatgcggg acagagcaac tacggctttg ccaatttccg 6240 
ccatggagcg tatctgtgag aaacgccggc acgaaggcct cccaggcctg gccgtgcagc 63 00 
ggggcgccat cggcgacgtg ggcattttgg tggagacgat gagcaccaac gacacgatcg 63 60 
tcagtggcac gctgccccag cgcatggcgt cctgcctgga ggtgctggac ctcttcctga 6420 
accagcccca catggtcctg agcagctttg tgctggctga gaaggctgcg gcctataggg 64 80 
acagggacag ccagcgggac ctggtggagg ccgtggcaca catcctgggc atccgcgact 6540 
tggctgctgt caacctggac agctcactgg cggacctggg cctggactcg ctcatgagcg 6600 
tggaggtgcg ccagacgctg gagcgtgagc tcaacctggt gctgtccgtg cgcgaggtgc 6660 
ggcaactcac gctccggaaa ctgcaggagc tgtcctcaaa ggcggatgag gccagcgagc 6720 
tgggcatgcc ccacgcccaa ggaggatggt ctggcccagc agcagactca gctgaacctg 6780 
cgctccctgc tggtgaaccc ggagggcccc accctgatgc ggctcaactg ccgtgcagag 6840 
ctcggagcgg cccctgttcc tggtgcaccc aattcgaggg ctccaccacc gtgttccaca 6900 
gcctggcctc ccggctcagc atccccacct atggcctgca gtgcacccga gctgcgcccc 6960 
ttgacagcat ccacagcctg gctgcctact acatcgactg catcaggcag gtgcagcccg 7020 
agggccccta ccgcgtggcc ggctactcct acggggcctg cgtggccttt gaaatgtgct 7080 
cccagctgca ggcccagcag agcccagccc ccacccacaa cagcctcttc ctgttcgacg 7140 
gctcgcccac ctacgtactg gcctacaccc agagctaccg ggcaaagctg accccaggct 7200 
gtgaggctga ggctgagacg gaggccatac gcttcttcgt gcagcagttc acggacatgg 72 60 
agcacaacag ggtgctggag gcgctgctgc cgctgaaggg cctagaggag cgtgtggcag 7320 
ccgccgtgga cctgatcatc aagagccacc agggcctgga ccgccaggag ctgagctttg 7380 
cggcccggtc cttctactac aagctgcgtg ccgctgagca gtacacaccc aaggccaagt 7440 
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accatggcaa cgtgatgcta ctgcgcgcca agacgggtgg cgcctacggc gaggacctgg 7500 
gcgcggacta caacctctcc caggtatgcg acgggaaagt atccgtccac gtcatcgagg 7560 
gtgaccaccg cacgctgctg gagggcagcg gcctggagtc catcatcagc atcatccaca 762 0 
gctccctggc tgagccacgc gtgagcgtgc gggagggcta ggcccgtgcc cccgcctgcc 768 0 
accggaggtc actccaccat ccccacccca tcccacccca cccccgccat gcaacgggat 7740 
tgaagggtcc tgccggtggg accctgtccg gcccagtgcc actgcccccc gaggctagct 7800 
agacgtaggt gttaggcatg tcccacccac ccgccgcctc ccacggcacc tcggggacac 7860 
cagagctgcc gacttggaga ctcctggtct gtgaagagcc ggtggtgccc gtgcccgcag 7920 
gaactggggc tgggcctcgt gcgcccgtgg ggtctgcgct tggtctttct gtgcttggat 7980 
ttgcatattt attgcattgc tggtagagac ccccaggcct gtccaccctg ccaagactcc 8040 
tcaggcagcg tgtgggtccc gcactctgcc cccatttccc cgatgtcccc tgcgggcgcg 8100 
ggcagccacc caagcctgct ggctgcggcc ccctctcggc caggcattgg ctcagcccgc 8160 
tgagtggggg gtcgtgggcc agtccccgag gactgggccc ctgcacaggc acacagggcc 8220 
cggccacacc cagcggcccc ccgcacagcc acccgtgggg tgctgccctt atgcccggcg 8280 
ccgggcacca actccatgtt tggtgtttgt ctgtgtttgt ttttcaagaa atgattcaaa 8340 
ttgctgcttg gattttgaaa tttactgtaa ctgtcagtgt acacgtctgg accccgtttc 8400 
atttttacac caatttggta aaaatgctgc tctcagcctc ccacaattaa accgcatgtg 8460 
atctccaaaa 8470 

<210> 11 

<211> 812 

<212> DNA 

<213> Homo sapiens 

<400> 11 

gccgcagcca atcagcgcgc gtgcccgggc ccctgcgtct cttgcgtcaa gacggccgtg 60 
ctgagcgaat gcaggcgact tgcgagctgg gagcgattta aaacgctttg gattcccccg 120 
gcctgggtgg ggagagcgag ctgggtgccc cctagattcc ccgcccccgc accccatgag 180 
ccgaccctcg gctccatgga gcccggcaat tatgccacct tggatggagc caaggatatc 240 
gaaggcttgc tgggagcggg aggggggcgg aatctggtcg cccactcccc tctgaccagc 3 00 
cacccagcgg cgcctacgct gatgcctgct g'tcaactatg cccccttgga tctgccaggc 3 60 
tcggcggagc gccaaagcaa tgccacccat gccctggggt gccccagggg acgcccccag 420 
ctcccgtgcc ttatggttac tttggaggcg ggtactactc ctgccgagtg tcccggagct 480 
cgctgaaacc ctgtgcccag gcagccaccc tggccgcgta ccccgcggag actcccacgg 540 
ccggggaaga gtaccccagc cgccccactg agtttgcctt ctatccggga tatccgggaa 600 
cctaccagcc tatggccagt tacctggacg tgtctgtggt gcagactctg ggtgctcctg 660 
gagaaccgcg acatgactcc ctgttgcccg tggacagtta ccagtcttgg gctctcgctg 720 
gtggctggaa cagccagatg tgttgccagg gagaacagaa cccaccaggt cccttttgga 780 
aggcagcatt tgcagactcc agcgggcagc ac 812 

<210> 12 

<211> 2385 

<212> DNA 

<213> Homo sapiens 

<400> 12 

ataagccggg gtaaagtatt ttcgcagttc ctgcctttag gattttatta gcttctctcc 60 
cccaggccgc agccaatcag cgcgcgtgcc cgggcccctg cgtctcttgc gtcaagacgg 120 
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ccgtgctgag cgaatgcagg cgacttgcga gctgggagcg atttaaaacg ctctggattc 180 
ccccggcctg ggtggggaga gcgagctggg tgccccctag attccccgcc cccgcacctc 240 
atgagccgac cctcggctcc atggagcccg gcaattatgc caccttggat ggagccaagg 3 00 
atatcgaagg cttgctggga gcgggagggg ggcggaatct ggtcgcccac tcccctctga 3 60 
ccagccaccc agcggcgcct acgctgatgc ctgctgtcaa ctatgccccc ttggatctgc 420 
caggctcggc ggagccgcca aagcaatgcc acccatgccc tggggtgccc caggggacgt 480 
ccccagctcc cgtgccttat ggttactttg gaggcgggta ctacccctgc cgagtgtccc 540 
ggagctcgct gaaaccctgt gcccaggcag ccaccctggc cgcgtacccc gcggagactc 600 
ccacggccgg ggaagagtac cccagccgcc ccactgagtt tgccttctat ccgggatatc 660 
cgggaaccta ccagcctatg gccagttacc tggacgtgtc tgtggtgcag actctgggtg 720 
ctcctggaga accgcgacat gactccctgt tgcctgtgga cagttaccag tcttgggctc 780 
tcgctggtgg ctggaacagc cagatgtgtt gccagggaga acagaaccca ccaggtccct 840 
tttggaaggc agcatttgca gactccagcg ggcagcaccc tcctgacgcc tgcgcctttc 900 
gtcgcggccg caagaaacgc attccgtaca gcaaggggca gttgcgggag ctggagcggg 960 
agtatgcggc taacaagttc atcaccaagg acaagaggcg caagatctcg gcagccacca 1020 
gcctctcgga gcgccagatt accatctggt ttcagaaccg ccgggtcaaa gagaagaagg 1080 
ttctcgccaa ggtgaagaac agcgctaccc cttaagagat ctccttgcct gggtgggagg 1140 
agcgaaagtg ggggtgtcct ggggagacca ggaacctgcc aagcccaggc tggggccaag 1200 
gactctgctg agaggcccct agagacaaca cccttcccag gccactggcr gctggactgt 1260 
tcctcaggag cggcctgggt acccagtatg tgcagggaga cggaacccca tgcgacagcc 1320 
cactccacca gggttcccaa agaacctggc ccagtcataa tcattcatcc tgacagtggc 1380 
aataatcacg ataaccagta ctagctgcca cgatcgctag cctcatattt tctatctaga 1440 
gctctgtaga gcacttcaga aaccgctttc atgaattgag ctaattatga ataaatctgg 1500 
aaggcgatcc ctttgcaggg aagctttctc tcagaccccc ttccattaca cctctcaccc 1560 
tggtaacagc aggaagactg aggagagggg aacgggcaga tccgttgtgt ggctgtgatg 162 0 
tccgtttagc atttttctca gctgacagct gggtaggtgg acaattgtag aggctgtctc 1680 
ttcctccctc cttgtccacc ccatagggcg tacccactgg tcttggaagc acccatcctt 174 0 
aatacgatga tttttctgtc gtgtgaaaat gaagccagca ggctgcccct agtcagtcct 1800 
tccttccaga gaaaaagaga tttgagaaag tgcctgggta attcaccatt aatttcctcc 1860 
cccaaactct ctgagtcttc ccttaatatt tctggcggtt ctgaccaaag caggtcatgg 1920 
tttgctgagc attcgggatc ccagtgaagt agatgtttgt agccttgcat acttagccct 1980 
tcccaggcac aaacggagtg gcagagtggt gccaaccctg ttttcccagt ccacgtagac 2040 
agattcacgt gcggaattct ggaagccgga gacagacggg ctccttgcag agccgggacc 2100 
ctgagaggga catgagggcc tctgcctctg tgttcattcx ctgatgtcct gtacctgggc 2160 
tcagtgcccg gcgggactca tctcctggcc gcgcagcaaa gccagcgggt tcgcgctggt 2220 
ccttcctgca ccttaggctg ggggtggggg gcccgccggc gcattctcca cgattgagcg 2280 
cacaggcctg aagtctggac aacccgcaga accgaagctc sgagcagcgg gtcggtggcg 234 0 
agtagtgggg tcggtggcga gcagttggtg gtgggccgcg gccgc 2385 

<210> 13 
<211> 221 
<212> DNA 

<213> Homo sapiens 
<400> 13 

dsdnrstatc tttctgtgtg gtgcagcccc gttggcagtg ggcatctggg tgtcaatcga 60 
tggggcatcc tttctgaaga tcttcgggcc actgtcgtcc agtgccatgc agtttgtcaa 120 
cgtgggctac ttcctcatcg cagccggcgt tgtggtcttt gctcttggtt tcctgggctg 180 
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ctatggtgct aagactgaga gcaagtgtgc cctcgtgacg t 221 

<210> 14 
<211> 1533 

<212> DNA * 
<213> Homo sapiens 

<400> 14 

gggcacgcag acattctggg aagccacttg ccccacccct gggctgcttc ttcttgagat 60 
caggaggggc gttgcccagg gctggtgttg ccaggtggag gcctgctgag gcagtggttg 120 
tggggatcgg tctccaggca gcagggggca gcagggtcaa ggagaggcta actggccacg 180 
ggtggggcca gcaggcgggc agaaggaggc tttaaagcgc ctaccctgcc tgcaggtgag 240 
cagtggtgtg tgagagccag gccgtccctc tgcctgccca ctcagtggca acacccggga 300 
gctgttttgt cctttgtgga gcctcagcag ttccctgctt tcagaactca ctgccaagag 360 
ccctgaacag gagccaccat ggcagtgctt cagcttcatt aagaccatga tgatcctctt 420 
caatttgctc atctttctgt gtggtgcagc cctgttggca gtgggcatct gggtgtcaat 4 80 
cgatggggca tcctttctga agatcttcgg gccactgtcg tccagtgcca tgcagtctgt 54 0 
caacgtgggc tacttcccca tcgcagccgg cgttgtggtc tttgctcttg gtttcctggg 600 
ctgctatggt gctaagactg agagcaagtg tgccctcgtg acgttcttct tcatcctcct 660 
cctcatcttc attgctgagg ctgcagctgc tgtggtcgcc ttggtgtaca ccacaatggc 720 
tgagcacttc ctgacgttgc tggtagtgcc tgccatcaag aaagattatg gttcccagga 780 
agacttcact caagtgtgga acaccaccat gaaagggctc aagtgctgtg gcttcaccaa 840 
ctatacggat tttgaggact caccctactt caaagagaac agtgcctttc ccccattctg 900 
ttgcaatgac aacgtcacca acacagccaa tgaaacctgc accaagcaaa aggctcacga 960 
ccaaaaagta gagggttgct tcaatcagct tttgtatgac atccgaacta atgcagtcac 1020 
cgtgggtggt gtggcagctg gaattggggg cctcgagctg gctgccatga ttgtgtccat 1080 
gtatctgtac tgcaatctac aataagtcca cttctgcctc tgccactact gctgccacat 1140 
gggaactgtg aagaggcacc ctggcaagca gcagtgattg ggggagggga caggatctaa 1200 
caatgtcact tgggccagaa tggacctgcc ctttctgctc cagacttggg gctagatagg 1260 
gaccactcct tttaggcgat gcctgacttt ccttccattg gtgggtggat gggtgggggg 1320 
cattccagag cctctaaggt agccagttct gttgcccatt cccccagtct attaaaccct 13 80 
tgatatgccc cctaggccta gtggtgatcc cagtgctctia ctgggggatg agagaaaggc 1440 
attttatagc ctgggcataa gtgaaatcag cagagcctct gggtggatgt gtagaaggca 1500 
cctcaaaatg cataaacctg ttacaatgtt gcc 1533 

<210> 15 

<211> 472 

<212> DNA 

<213> Homo sapiens 

<400> 15 

tcagagaaaa ctcaaacttt attgagagaa ttttcaaatt ttcagtcaca ttttcaatgt 60 
gacatcagcc atgtgtgtag cttcagcttg tcttcttttt aacttatggc tgcccatctc 120- 
ctgcttcttt agtcttagca tgcttaggat taggtggagt cttctctttt acatcagagc 180 
catctccacg ctcactccga gtcttttcca gatccatttc ctggcaatca ccttctactt 240 
tacgttcttc gatcggaggt gttccttctc tctcctgtcc aggttcaata tcctgattgt 300 
cagttggtgg ttcctcttgc tgagattcac cgggagccac gaatgcaacc acatcgggag 360 
cctcctgacc atctcctctt cctctggatc ttgatctcac tcgtgcactc atcgctgcaa 420 
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ctagaagatc gtgaactgaa gaacttgagt cagcagagag cctggcgaag aa 472 

<210> 16 

<211> 478 

<212> DNA 

<213> Homo sapiens 



<40O> 16 

cttcattctt cgccaggctc 
cagcgatgag tgcacgagtg 
ccgatgtggt tgcattcgtg 
accaggatat tgaacctgga 
tagaaggtga ttgccaggaa 
ctgatgtaaa agagaagact 
atgggcagcc ataagttaaa 
ttgaaaatgt gactgaaaat 

<210> 17 

<211> 198 

<212> DNA 

<213> Homo sapiens 



tctgctgact caagttcttc 
agatcaagat ccagaggaag 
gctcccggtg aatctcagca 
caagagagag aaggaacacc 
atggatctgg aaaagactcg 
ccacctaatc ctaagcatgc 
aagaagacaa gctgaagcta 
ttgaaaattc tctcaataaa 



agttcacgat ctcctagttg 60 
aggagatggt caggaggctc 120 
agaggaacca ccaactgaca 18 0 
tccgatcgaa gaacgtaaag 24 0 
gagtgagcgt ggagatggct 300 
taagactaaa gaagcaggag 3 60 
cacacatggc tgatgtcaca 420 
gtttgagttt tctctgaa 478 



<220> 

<221> unsure 
<222> (191) 

<400> 17 

cccgctgtac caccccagca tgttctgcgc 
caacggtgac tctggggggc ccctgatctg 
cggaaaagcc ccgtgtggcc aagttggcgt 
cactgagtgg nattaagg 

<210> 18 
<211> 465 
<212> DNA 

<213> Homo sapiens 



cggcggaggg caagaccaga aggactcctg 60 
caacgggtac ttgcagggcc ttgtgtcttt 120 
gccaggtgtc tacaccaacc tctgcaaatt 180 

198 



<400> IB 

tggagatgga gtatgtattt attttacaaa 
actggaacat ttcgagcaat gagtgcgcca 
gttcgcgtca cccccagggc caccttggcg 
ctccaactcc cttccctcgc agccgccatt 
gcctggacac cggaaaaggc gattccctga 
agaatttaaa catctttcta aggtaagcgc 
ttgcaccagg ggcggttggg aaggaagttg 
ctgttgaaaa aaggttctgg gtcaaataat 



aataaatcac catcttcgga ccatttgtag 60 
cacggacgag tgccctggtg actccctgat 120 
cccgcatgag cctcgcttcc cactcccggc 180 
caccttctgc tgtttatttg tctgcagagc 240 
gcgcctggag ttggagacaa ttcctggttc 3 00 
tgctccaaaa ctcttcgccg cgtggggact 360 
gccctccacg ggttcctggg caaccgcggc 420 
ttaacttcgg aggag 465 



<210> 19 
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<211> 204 
<212> DNA 

<213> Homo sapiens 
<400> 19 

ggcgggaaca ggcggcgctg gacctgtacc 
ccttctcctc ccccaacttc gccaccatcc 
cctctcccag ccacccggcc aactccttct 
tcgccagggt gacactggtg cggc 

<210> 20 
<211> 294 
<212> DNA 

<213> Homo sapiens 



cctacgacgc cgggacggac agcggcttca 60 
cgcaggacac ggtgaccgag ataacgtcct 120 
actacccgcg gctgaaggcc ctgcctccca 180 

204 



<220> 

<221> unsure 
<222> (287) 

<400> 20 

gagatttctc ttcaatggct tcctgtgagc 
ctagagatgg aagtagcttg gacgattttc 
ccaaccacag ctgggagcca ctgctcaggg 
aggttctata caggatataa aggtgcctca 
aacaaacact gatctctttc tgccacccct 

<210> 21 
<211> 22 
<212> DNA 

<213> Artificial Sequence 



tagagtttga aaatatctta aaatcttgag 60 
attatcatgt aaaccgggtc actcaagggg 12 0 
gaaggttcac atgggacttt ctactgccca 180 
cagtatagat ctggtagcaa agtaagaaga 24 0 
ctgacccttt ggaactnctc tgac 294 



<220> 

<223> Description of Artificial Sequence .-Synthetic 
<400> 21 

atcagaacaa agaggctgtg tc 22 

<210> 22 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic 
<400> 22 

atctctaaag ccccaacctt c 21 
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<210> 23 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic 
<400> 23 

tgccgaagag gttcagtgc 

<210> 24 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic 
<400> 24 

gccacagtgg tactgtccag at 

<210> 25 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic 
<400> 25 

gctgcaagtt ctccacattg a 

<210> 26 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic 
<400> 26 

cagccgcagg tgaaacac 

<210> 27 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

18 
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<220> 

<223> Description of Artificial Sequence : Synthetic 
<400> 27 

tggctttgaa ctcagggtca 

<210> 28 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic 
<400> 28 

cggatgcacc tcgtagacag 

<210> 29 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic 
<400> 29 

cggcaacctg gtagtgagtg 

<210> 30 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic 
<400> 30 

cgcagctcct tgtaaacttc ag 

<210> 31 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic 

19 
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<400> 31 



cgggaaccta ccagcctatg 



20 



<210> 32 

<211> 20 * 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 32 

caggcaacag ggagtcatgt 20 

<210> 33 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic 



<210> 34 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic 
<400> 34 

cggctgcgat gaggaagta 19 

<210> 35 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic 



<400> 33 



tgggcatctg ggtgtcaa 



18 



<400> 35 



gcccatctcc tgcttcttta gt 



22 



<210>- 36 
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<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic 
<400> 36 

cgtggagatg gctctgatgt a 
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